per 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 5 : 
O07K 7/10, A61K 39/21 
C12N 15/48, C12P 21/02, 21/08 
G01N 33/569 



Al 



(U) International Publication Niraber: 
(43) International Publication Date: 



WO 90/03984 

19 April 1990(19.04.90) 



(21) International Application Number : PCT/US89/04302 

(22) International Filing Date: 29 September 1989 (29.09.89) 



(30) Priority data: 

252,949 
359,543 
407,663 



3 October 1988 (03.10.88) US 
1 June 1989 (01.06.89) US 
19 September 1989(19.09.89) US 



(71) Applicant: REPLIGEN CORPORATION [US/US]; One 

Kendal! Square, Building 700, Cambridge, MA 02139 
(US). 

(72) Imrentors: RUSCHE, James, R. ; 18 Brigham Road, Fram- 

ingham, MA 01701 (US). PUTNEY, Scott, D. ; 5 Epping 
Street, Arlington, MA 02174 (US). JAVAHERIAN, 
Kashayar ; 27 Webster Road, Lexington, MA 02173 
(US). FARLEY, John ; 507 Reeves Road, Pittsford, NY 
14534 (US). GRIMAILA, Raymond ; 311 Washington 
Street, Somerville, MA 02134 (US). LYNN, Debra ; 1 
Watermiil Place, Unit 328, Arlington, MA 02174 (US). 
PETRO-BREYER, Joan ; Institut Pasteur, 



Department of Biotechnology, 25, rue du Docteur-Roux, F- 
75724 Paris Cedex 15 (FR). O'KEEFFE, Thomas ; 14 Karen 
Road, Windham, NH 03089 (US). LAROSA, Gregory ; 85 
Strathmore Road, Apt 33, Brookline, MA 02146 (US). PRO- 
FY, Albert, T. ; 28 Essex Street, Cambridge, MA 02139 (US). 

(74) Agents: SALIWANCHIK, Roman et al.; Saliwanchik & 
Saliwanchik, 2421 N.W. 41st Street, Gainesville, FL 
32606 (US). 

(81) Designated States: AT (European patent), AU, BE (Euro- 
pean patent), CH (European patent), DE (European pa- 
tent), DK, Fl, FR (European patent), GB (European pa- 
tent), IT (European patent), JP, KR, LU (European pa- 
tent), NL (European patent), NO, SE (European patent). 

Published 

With international search report 



(54) Title: HIV PROTEINS AND PEPTIDES USEFUL IN THE DIAGNOSIS, PROPHYLAXIS OR THERAPY OF AIDS 



(57) Abstract 
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DESCRIPTION 



HTV PROTEINS AND PEPTIDES USEFUL 
IN THE DIAGNOS IS. PROPHYLAXIS OR THERAPY OF AIDS 

5 

Cross-Reference to a Related Application 
This is a continuation-in-part of our co-pending application Serial No. 359,543, filed on June 
1, 1989, which is a continuation-in-part of our co-pending application Serial No. 252,949, filed on 
October 3, 1988, which is a continuation-in-part of our co-pending application Serial No. 090,080, filed 
10 on August 27, 1987. 



Background of the Invention 
Human immunodeficiency virus (HTV), human T-cell h/mphotropic virus 111 (HTLV-HI), 
rymphadenopathy-associated virus (LAV), or AIDS-associated retrovirus (ARV) has been identified 
15 as the cause of acquired immune deficiency syndrome (AIDS) (Popovic, M., M.G. Sarngadharan, E. 

Read, and R.G Gallo [1984] Science 224:497-500). The virus displays tropism for the OKT4+ 
lymphocyte subset (Klatzmann, D., F. Barre-Sinoussi, M.T. Nugeyre, C Dauget, E. Vilmer, C Griscelli, 
F. Brun-Vezinet, C Rouzioux, J.D. Gluckman, J.C Chermann, and L. Montagnier (1984) Science 
225:59-63). Antibodies against HIV proteins in the sera of most AIDS and AIDS related complex 
20 (ARC) patients, and in asymptomatic people infected with tbe virus (Sarngadharan, M.G., M. Popovic, 

L Bruch, J. Schupbach, and R.C Oallo [1984] Science 224:506-508) have made possible the 
development of immunologically based tests that detect antibodies to these antigens. These tests arc 
used to Umit the spread of HTV through blood transfusion by identifying blood samples of people 
infected with the virus. Diagnostic tests currently available commercially use ihe proteins of 
25 inactivated virus and antigens. 

In addition to allowing new approaches for diagnosis, recombinant DNA holds great promise 
for the development of vaccines against both bacteria and viruses (Wilson, T. [1984] Bio/Technology 
2:29-39). The most widely employed organisms to express recombinant vaccines have been E coli, S. 
ccrevisiac and cultured mammalian cells. For example, snbunit vaccines against foot and mouth 
disease (Kleid, D.G., D. Yansura, B. Small, D. Dowbenlco, D.M. Moore, MJ. Brubraan, P.D. 
McKercher, D.O. Morgan, B.R Robertson, and H.L Bachracb [1981] Science 214:1125-1129) and 
malaria (Young, J.F., W.T. Hockmeyer, M. Gross, W. Ripley-Ballou, R.A Winz, J.H. Trosper, RL 
Beaudoin, M.R. Hollingdale, LM. Mfller, CL Diggs, and M Rosenberg [1985] Science 228:958-962) 
have been synthesized tn E coli. Other examples are hepatitis B surface antigeu produced in yeasi 
(McAleer, WJ„ E.B. Buynak, R.Z. Maigetier, D.E. Wamplcr, WJ. Miller, and M.R. Hilleman |1984) 
Nature 307:178-180) and a herpes vaccine produced in mammalian cells fBerman, P.W., T. Gregory, 
D. Chase, and LA Lasky [1984] Science 227:1490-1492). 
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The entire HIV envelope or portions thereof have been used to immunize animals. The terms 
•protein," 'peptide/ and "polypeptide' have been used interchangeably in this application to refer 10 
chemical compounds having more than one amino acid. The term "compound" as used here refers to 
chemical compounds in general Thus, "compound" includes proteins, peptides, and polypeptides. Also 
5 included under the category of "compound" are fusion compounds where polypeptides are combined 

with non-poh/peptlde moieties. As used in the present application, the term "naturally occurring HIV 
envelope protein' refers to the proteins gpl60, gpl20, and gp41 only. As used in the present 
application, HIV refers to any HIV virus, including HIV-1 and HIV-2. 

Both the native gpl20 (Robey et ai [1986] Proa Natl. Acad. Sci. 83:7023-7027; Matthews et 
10 al. [1986] Proc. Natl. Acad. Sci. 83:9709-9713) and recombinant proteins (Laskey et al. [1986] Science 

233:209-212; Putney et al. [1986] Science 234:1392-1395) elicit antibodies that can neutralize HTV in 
cell culture. However, all of these immunogens elicit antibodies that neutralize only the viral variant 
from which the subunit was derived. Therefore, a novel vaccine capable of protecting against multiple 
viral variants would be advantageous and unique. 
15 HIV is known to undergo amino acid sequence variation, particularly in the envelope gene 

(Starcich, B.R. [1986] Cell 45:637-648; Hahn, BJI. et aL [1986] Science 232:1548-1553). Over 100 
variants have been analyzed by molecular cloning and restriction enzyme recognition analysis, and 
several of these have been analyzed by nucleotide sequencing. Some of these are the HIV isolates 
known as RF (Popovic, M et al. [1984] Science 224:497-500), WMJ-1 (Hahn, B.H. et al. [19861 
20 Science 232:1548-1553), LAV (Wain-Hobson, S. et aL [1985] Cell 40:9-17), and ARV-2 (Sanchez- 

Pescador, R. et al. [1985] Science 227:484-492). One aspect of this invention is defining the portion 
of HIV that comprises the principal neutralizing domain. The principal neutralizing domain is located 
between the cysteine residues at amino acids 296 and 331 of the HIV envelope. The numbering of 
amino acids follows the published sequence of HIV-in B (Ratner, L. et al. [1985] Nature 313:277- 
25 284). This domain is known to be hypervariable but retains the type-specific antigenic and 

immunogenic properties related to virus neutralization. 

A further aspect of the subject invention is the discovery of highly conserved amino acids 
within the principal neutralizing domain. Although certain sequences from this region have been 
published (see, for example, Southwest Foundation for Biological Research, published PCT application, 
Publication No. WO 87AE775; Genetic Systems Corporation, Published United Kingdom Application 
No. GB 2196634 A; Stichting Centraal Diergeneeskundig InsrJluut, Published EPO Application No. 
0 311 219), the presence of the conserved regions described here have never before been described. 

Diagnostic kits or therapeutic agents using viral proteins isolated from virus infected cells or 
recombinant proteins would contain epitopes specific to the viral variant from which they were 
isolated. Reagents containing proteins from multiple variants would have the utility of being more 
broadly reactive due to containing a greater diversity of epitopes. This would be advantageous in the 
screening of serum from patients or therapeutic treatment of patients. 
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25 



35 



Synthetic peptides can be advantageous as the active ingredient in a vaccine, therapeutic agent 
or diagnostic reagent due to the ease of manufacture and ability to modify their structure and mode 
of presentation. 

Synthetic peptides have been used successfully in vaccination against foot and mouth disease 
virus (Bittle et al. [1982] Nature 29830-33); poUovirus (Rmini et aL [1983] Nature 305:699); hepatitis 
B (Gerin et aL [1983] Proc NatL Acad. Sd. 802365-2369); and influenza (Shapira et al. [1984] Proc 
NatL Acad. Sci. 812461-2465). 

There is a real need at this time to develop a vaccine for AIDS. Such a vaccine, 
advantageously, would be effective to immunize a host against the variant AIDS viruses. 



Brief Summary of the Invention 
The subject invention defines the location of the HIV principal neutralizing domain and 
discloses methods to utilize this segment of the HIV envelope protein for developing diagnostic, 
therapeutic, and prophylactic reagents. More specifically, the HIV principal neutralizing domain is 
15 located between cysteine residues 296 and 331 of the HTV envelope protein. The location of this 

domain is shown in Table 1. Although the specific amino add sequence of the principal neutralizing 
domain is known to be highly variable between variants, we have found that peptides from this domain 
are invariably capable of raising, and/or binding with, neutralizing antibodies. This unexpected 
discovery provides a basis for designing compositions and strategies for the prevention, diagnosis, and 
20 treatment of AIDS. 

The discovery of the principal neutralizing domain (also known as the "loop") resulted from 
extensive research involving a multitude of HTV envelope proteins and peptides from many HIV 
variants. Proteins and peptides capable of raising, and/or binding with, neutralizing antibodies are 
disclosed here. These novel HTV proteins and peptides, or their equivalents, can be used in the 
diagnosis, prophylaxis, and/or therapy of AIDS. Further, the peptides can be used as immunogens or 
screening reagents to generate or identify polyclonal and monoclonal antibodies that would be useful 
in prophylaxis, diagnosis and therapy of HTV infection. 

A farther aspect of the invention is the discovery of highly conserved regions within the 
principal neutralizing domain. This discovery was quite unexpected because of the known variability 
30 of the amino acids within this segment of the HTV envelope protein. 

The proteins and peptides of the invention are identified herein by both their amino acid 
sequences and the DNA encoding them. Thus, they can be prepared by known chemical synthetic 
procedures, or by recombinant DNA means. 

These peptides, or peptides having the antigenic or immunogenic properties of these peptides, 
can be used, advantageously, in a vaccine, eg., a cocktail of peptides, to eudl broad neutralizing 
antibodies in the host. Further, these peptides can be used sequentially, e,g^ immunizing initially with 
a peptide equivalent to the prindpal neutralizing domain of an HIV variety followed by immunization 
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with one or more of the above peptides. Polyclonal or monoclonal antibodies that bind to these 
peptides would be advantageous in prophylaxis or therapy against HIV, the causative agent of AIDS. 

Brief Description of the Drawings 
5 Figure 1 shows commonly occurring sequences of the principal neutralizing domain. 

Figure 2 is a schematic for multi-epitope gene construction. 
Figure 3 depicts the steps in the construction of a specific multi-epitope gene. 
Figure 4 shows the sequences of four synthesized single-stranded oligomers for construction 
of a multi-epitope gene. 

10 

Detailed Disclosure of the Invention 
Described here is a segment of the HIV envelope protein which raises, and/or binds with, 

neutralizing antibodies. This unique and highly unexpected property has been observed in each HIV 

variant that has been examined. The segment of interest has been named the "principal neutralizing 
15 domain." The principal neutralizing domain is bounded by cysteine residues which occur at positions 

296 and 331. It should be noted that these same cysteine residues have been described as beginning 

at 302, rather than 296 (Rusche, J.R. et aL [1988] Proc. Natl Acad. Sci. USA 85(15)3198-3202). 

Because the cysteine residues are linked through disulfide bonds, the segment between the residues 

tends to form a loop. Therefore, the principal neutralizing domain is also referred to as the "loop." 
20 The segment of the protein envelope identified here as the principal neutralizing domain is 

known to be highly variable between HIV variants. Thus it is surprising that, for each variant, this 

segment is capable of eliciting, and/or binding with, neutralizing antibodies. 

The principal neutralizing domain identified here is a small segment of the HIV envelope 

protein. This small segment may be combined with additional amino acids, if desired, for a specific 
25 purpose. All such proteins are claimed here except where such proteins constitute a naturally 

occurring HTV envelope protein. As used here, the term "naturally occurring envelope protein" refers 

only to gpl60, gpl20, and gp41. 

listed in Table 1 are sequences of the principal neutralizing domain for some of the variants 

tested. Table 9 contains a complete list of the principal neutralizing domains. 
30 Amino acids may be referred to using either a three-letter or one-letter abbreviation system. 

The following is a list of the common amino acids and their abbreviations: 
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Amino add 


Three-letter symbol 


One-letter 


Alanine 


Ala 


A 


Arginine 


Arg 


R 


Asparagine 


Asn 


N 


Aspanic acid 


Asp 


D 


Asn and/or Asp 


Aax 


B 


Cysteine 


Cys 


C 


Glutamine 


Gin 


Q 


Glutamic acid 


Glu 


E 


Gin and/or Glu 


Gbt 


Z 


Glycine 


Gly 


G 


Histidine 


His 


H 


Isoleucine 


He 


1 


Leucine 


Leu 


L 


Lysine 


Lys 


K 


Methionine 


Met 


M 


Phenylalanine 


Phe 


F 


Proline 


Pro 


P 


Serine 


Ser 


S 


Threonine 


Thr 


T 


Tryptophan 


Trp 


W 


Tyrosine 


Tyr 


Y 


Valine 


Val 


V 



The following is a list of proteins and peptides which comprise principal neutralizing domains 
or segments thereof. 

A. Recombinant Proteins Comprising a Principal Neutralizing Domain 

1. HIV 10 Kd fusion protein denoted Sub 1. The amino acid sequence of the HIV 
portion of Sub 1 is shown in Tabic 2 and the DNA sequence of the HIV portion of 
Sub 1 in Table 2A. The amino acid sequence of Sub 1 is shown in Tabic 2B and the 
DNA sequence in Table 2C 
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2. HIV 18 Kd fusion protein denoted Sob 2. The amino add sequence of the HIV 
portion of Sub 2 is shown in Table 3 and the DNA sequence of the HIV portion of 
Sub 2 in Table 3A The entire amino acid sequence of Sub 2 is shown in Table 3B 
and the entire DNA sequence in Table 3C 
5 3. HIV 27 Kd fusion protein denoted PBIrp The amino add sequence of the HIV 

portion of PBIrf is listed in Table 4 and the DNA sequence of the HIV portion of 
PBIrp is listed in Table 4A The entire amino acid sequence and DNA sequence of 
PBIrp are in Tables 4B and 4C, respectively. 

4. HIV 28 Kd fusion protein denoted PBImn. The amino acid sequence of the HIV 
10 portion of PBImn is shown in Table 5 and the DNA sequence of the HIV portion 

of PBImn is shown in Table 5A The entire amino acid sequence and DNA sequence 
of PBImn are shown in Tables 5B and 5C, respectively. 

5. HIV 26 Kd fusion protein denoted PB1 SC . The amino acid sequence of the HIV 
portion of PB1 SC is listed in Table 6 and the DNA sequence of the HIV portion of 

IS PB1 SC is shown in Table 6A The entire amino acid sequence and DNA sequence 

of PB1 SC are shown in Tables 6B and 6C, respectively. 

6. HIV 26 Kd fusion protein denoted PBlw&o2- The amino acid sequence of the HIV 
portion of PBlwxm is listed in Table 7 and the DNA sequence of the HTV portion 
of PB1 WMJ2 is shown in Table 7A The entire amino acid sequence and DNA 

20 sequence of PBIwmk arc shown in Tables 7B and 7C, respectively. 



B. Synthetic Peptides Comprising Segments or the Principal Neutralizing Domain From IITV 
Variants 

The amino acid cysteine in parentheses is added for the purpose of crosslinking lo carrier 
25 proteins. Also, where the peptides have cysteines at or near both ends, these cysteines can form a 

disulfide bond, thus giving the peptides a loop-like configuration. For any of these peptides which do 
not have cysteines at or near both ends, cysteines may be added if a loop-like configuration is desired. 
The loop configuration can be utilized to enhance the immunogenic properties of the peptides. Other 
amino acids in parentheses are immunologically silent spacers. 

30 

Peptide 135 (from isolate HIV-ni B ): 
Asn Asn Thr Arg Lys Ser He Arg lie Gin Arg Gly 
Pro Gly Arg Ala Phe Val Thr He Gly Lys He Gry 
(Cys) 



35 
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Peptide 136 (from isolate HTV-IIIB): 
Leu Asn Gin Scr Val Glu De 
Asn Asn Thr Arg Lys Ser lie 
Gry Arg Ala Phe Val Thr De 

5 

Peptide 139 (from isolate HIV-RF): 
Asn Asn Thr Arg Lys Ser lie 
Arg Val Ue Tyr Ala Thr Gly 



7 

Asn Cys Thr Arg Pro Asn 

Arg lie Gin Arg Gry Pro 

Gly Lys De Gry Asn Met 

Thr Lys Gry Pro Gry 

Gin De De Gry (Cys) 



10 Peptide 141 (from isolate HIV-WMJ2): 

Asn Asn Val Arg Arg Ser Leu Ser lie Gly Pro Gly 
Arg Ala Phe Arg Thr Arg Glu De De Gly (Cys) 



Peptide 142 (from isolate HTV-MN): 
15 Tyr Asn Lys Arg Lys Arg He His De Gry Pro Gly 

Arg Ala Phe Tyr Thr Thr Lys Asn De De Gry 

(Cys) 



Peptide 143 (from isolate HIV-SC): 
20 Asn Asn Thr Thr Arg Ser De His De Gry Pro Gly 

Arg Ala Phe Tyr Ala Thr Gry Asp He He Gry 

(Cys) 



Peptide 131 (from isolate HTV-IIIB) : 
(Tyr) Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser De 
Arg Ue Gin Arg Gly 



Peptide 132 (from isolate HTV-niB): 
Pro Gly Arg Ala Phe Val Thr He Gry Lys lie Gly Asn 
Met Arg Gin Ala His Cys (Tyr) 



35 



Peptide 134 (from isolate HIV-inB): 
Glu Arg Val Ala Asp Leu Asn Gin 
Cys Thr Arg Pro Asn Asn Asn Thr 



Ser 
Arg 



Val Glu He Asn 
Lys Ser lie 
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Peptide 339 (from isolate HTV-RF): 
He Thr Lys Gly Pro Gly 

RP341 (from isolate HIV-WMJ2): 
5 Leu Ser lie Gly Pro Gly 

RP343 (from isolate HTV-SQ: 
He His lie Gly Pro Gly 

10 RP60 (from isolate HTV-HIB): 

He Asn Cys Thr Arg Pro 
He 

RP33S (from isolate HIV-IIIB): 
He Gin Arg Gly Pro Gly 

RP337 (from isolate HIV-IIIB): 
Lys Ser He Arg He Gin 
(Cys) 

RP77 (from isolate HTV-IIIB): 
Gly Pro Gly Arg Ala Phe 

RP83 (from isolate HTV-WMJ1): 
His He His He Gly Pro 

(cys) 

RP79 (from isolate HIV-TUB): 
Gin Arg Gly Pro Gly Arg 

RP57 : 

He Asn Cys Thr Arg Pro 
RP55: 

Ala His Cys Asn He Ser 



8 

Arg Val He Tyr (Cys) 

Arg Ala Phe Arg (Cys) 

Arg Ala Phe Tyr (Cys) 

Asn Asn Asn Thr Arg Lys Ser 

Arg Ala Phe (Cys) 

Arg Gly Pro Gly Arg Ala Phe 

Gly Arg Ala Phe Tyr Thr Gly 

Ala Phe (Cys) 

Ala His Cys Asn lie Ser 
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RP75A: 

(Ala Ala Ala Ala Ala Ala) Gly Pro Oly Arg (Ala 
Ala Ala Ala Ala Cys) 

5 

RP56: 

He Asn Cys Thr Arg Pro 
RP59: 

10 He Gly Asp lie Arg Gin Ala His Cys Asn lie Ser 

RP342 (from isolate HIV-MN): 
lie His He Gry Pro Gly Arg Ala Phe Tyr Thr (Cys) 

15 RP96 (HIV-MN related): 

(Cys) Gly lie His lie Gry Pro Gly Arg Ala Phe Tyr Thr 
(Cys) 

RP97 (HIV-MN related): 
20 (Ser Gly Gry) He His lie Gry Pro Gry Arg Ala Phe Tyr (Gly 

Gry Ser Cys) 

RP98 (HIV-MN related): 
(Cys Ser Gly Oly) De His lie Gly Pro Gly Arg Ala Phe Tyr 
25 (Gry Gly Ser Cys) 

RP99 (HIV-MN related): 

(Cys Ser Gry Gly) He His He Gry Pro Gry Arg Ala Phe Tyr 

(Gly Gry Ser) 

30 

RP100 : 

(Ser Gry Gry) Thr Arg Lys Gly He His He Gry Pro Gly Arg 

Ala He Tyr (Gry Gly Ser Cys) 
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RP1Q2 : 

(Scr Oly Gly) Thr Arg Lys Ser lie Ser He Gly Pro Gly Arg 
Ala Phe (Gly Gly Ser Cys) 

5 RP91 (MN-Hepatitis fusion): 

Arg He His He Gly Pro Gly Arg Ala Phe Tyr Gly Phe Phe 
Leu Leu Thi Arg De Leu Thr lie Pro Gin Ser Leu Asp (Cys) 

RP104 : 

0 (Ser Gly Gly) lie Gly Pro Gly Arg Ala Phe Tyr Thr Thr Lys 

(Gly Gly Ser Cys) 

RP106 : 

(Ser Gly Gly) Arg De His He Gly Pro Gly Arg Ala Phe (Gly 
5 Gly Ser Cys) 

RP108 : 

(Ser Gly Gly) His lie Gly Pro Gly Arg Ala Phe Tyr Ala Thr 
Gly (Gly Gry Ser Cys) 

0 

RP7Q (from isolate HJV-MN): 
He Asn Cys Thr Arg Pro Asn TyT Asn Lys Arg Lys Arg He 
His De Gly Pro Gly Arg Ala Phe Tyr Thr Thr Lys Asn He 
He Gly Thr He Arg Gin Ala His Cys Asn He Ser 

5 

RP84 (from isolate HIV-MN): 
He His He Gly Pro Gly Arg Ala Phe Tyr Thr Gly (Cys) 

RP144 (from isolate 7887-3): 
0 Asn Asn Thr Ser Arg Gly He Arg He Gly Pro Gly Arg Ala He 

Leu Ala Thr Glu Arg He He Gly (Cys) 

RP145 (from isolate 6587-7): 
Asn Asn Thr Arg Lys Gly He His lie Gly Pro Gry Arg Ala 
5 Phe Tyr Ala Thr Gly Asp He lie Gly (Cys) 



WO 90/03984 



PCI7US89/04302 



11 



RP146 (from isolate CC); 

Asn Asn Thr Lys Lys Gly Be Arg lie Gly Pro Gly Arg Ala 

Val Tyr Thr Ala Arg Arg He lie Gly (Cys) 

5 

RP147 (from isolate KK261): 

Asn Asn Thr Arg Lys Gly He Tyr Val Oly Ser Gly Arg Lys 

Val Tyr Thr Arg His Lys He lie Gly (Cys) 

10 RP150 (from isolate ARV-2): 

Asn Asn Thr Arg Lys Ser lie Tyr He Gly Pro Gly Arg Ala 
Phe His Thr Thr Gly Arg lie lie Gly (Cys) 

RP151 (from isolate NY5): 
15 Asn Asn Thr Lys Lys Gly He Ala lie Gly Pro Gly Arg Thr 

Leu Tyr Ala Arg Glu Lys De He Gly (Cys) 



C Hybrid Peptides Containing Sequences from More Than One Variant 

RP73 (from isolates HTV-IIIB, HTV-RF): 
20 Lys Ser lie Arg He Gin Arg Gly Pro Gly Arg Val He 

Tyr (Cys) 



RP74 (from isolates HTV-IIIB, HJV-RF, HTV-MN, HIV-SC): 
Arg He His He Gly Pro Gly Arg Ala Phe Tyr Ala Lys 
25 Ser He Arg He Gin Arg Gly Pro Gry Arg Val He Tyr 

(Cys) 



RP80 (from isolates HTV-IIIB, HTV-RF): 
Arg He Gin Arg Gly Pro Gry Arg Val He Tyr Ala Thr 
30 (Cys) 



RP81 (from isolates HTV-IIJB, HTV-RF, HTV-WMJ1, HTV-MN): 
Arg He His He Gly Pro Gly Arg Ala Phe Tyr Thr Gry 
Arg He Gin Arg Gly Pro Gly Arg Val He Tyr Ala Thr 
35 (Cys) 
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RP82 (from isolates HIV-MN, HIV-WMJ1): 
Arg He His He Oly Pro Gly Arg Ala Phe Tyr Thr Gly 
(Cys) 

5 

RP88 (from isolates HIV-MN, HTV-SC): 
Ser lie His Le Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly 
(Cys) 

10 RP137 (from isolates HIV-IIIB, HTV-RF): 

Asn Asn Thr Arg Lys Ser He Arg Be Thr Lys Gly Pro 
Gly Arg Ala Phe Val Thr lie Gly Lys He Gly (Cys) 

RP140 (from isolates HIV-IIIB, HIV-RF): 
15 Asn Asn Thr Arg Lys Ser lie Thr Lys Gly Pro Gly Arg 

Ala Phe Val Thr lie Gly Lys He Gly (Cys) 

Peptide 64 (from isolates HIV-IIIB, HTV-RF, HIV-MN, HIV-SC): 
Arg He His He Gly Pro Gly Arg Ala He Phe Tyr Arg 
20 He Gin Arg Gly Pro Gly Arg Val He Tyr (Cys) 

Peptide 338 (from isolates HTV-HIB, HTV-RF): 
Arg He Gin Arg Gly Pro Gry Arg Val He Tyr (Cys) 

25 Peptide 138 (from isolates HIV-IIIB, HTV-RF): 

Asn Asn Thr Arg Lys Ser He Arg He Gin Arg Gry 
Pro Gly Arg Val He Tyr Ala Thr Gly Lys He Gry 
(Cys) 

30 RP63 (IIIg-RF hybrid): 

Arg He Gin Arg Gly Pro Gly Arg Val He Tyr (Cys) 

D. Miscellaneous Peptide Sequences 

RP41: 

35 Gly pro Gly Arg 
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RP61 : 

Gry Pro Gly Arg (Ala Ala Ala Ala Ala Ala Cys) 
5 RP75 : 

(Cys Ala Ala Ala Ala Ala) Gly Pro Gly Arg Ala Phe (Ala Ala 
Ala Cys) 

RP111 : 

ID He Gin Arg Gly Pro Gly lie Gin Arg Gly Pro Gry (Cys) 

RP113 : 

Gin Arg Gry Pro Gry Arg Gin Arg Gly Pro Gry Arg Gin Arg 
Gly Pro Gry Arg (Cys) 

15 

RP114 : 

Arg Gly Pro Gly Arg Gry Pro Gry Arg Gry Pro Gly Arg Gry 
Pro Gry Arg (Cys) 

20 RP116 : 

Gly Pro Gry Arg Ala Phe Gly Pro Gly Arg Ala Phe Gly Pro 
Gly Arg Ala Phe (Cys) 



25 



RP120 : 

Ser He Arg De Gry Pro Gry Arg Ala Phe Tyr Thr (Cys) 
RP121C 

(Cys) Gly Pro Gly Arg (Cys) 



30 RP122c : 

(Cys) He Gly Pro Gry Arg Ala (Cys) 

RP123c: 

(Cys) His lie Gry Pro Gly Arg AJa Phe (Cys) 

35 
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The proteins and peptides exemplifying the subject invention can be made by well-known 
synthesis procedures. Alternatively, these entities can be made by use of recombinant DNA 
procedures. Such recombinant DNA procedures are disclosed herein since they were, in tact, the 
procedures initially utilized to obtain the novel proteins and peptides of the invention. However, once 
5 these entities were prepared and their molecules sequenced, it is apparent to a person skilled in the 

art that the preferred method for making them would now be by chemical synthesis means. For 
example, there are available automated machines which can readily make proteins and peptides of the 
molecular sizes disclosed herein. 

In the recombinant DNA procedures for making some of the proteins and peptides of the 
10 invention, an expression vector plasmid denoted pREV22 was used. This plasmid was initially 

constructed from a plasmid denoted pBGl. 

Plasmid pBGl is deposited in the H. coli host MS371 with the Northern Regional Research 
Laboratory (NRRL, U.S. Department of Agriculture, Peoria, Illinois, USA). It is in the permanent 
collection of this repository. R coli MS371(pBGl), NRR1 B-15904, was deposited on November 1, 
15 1984. E. coy MS371, NRRL B-15129 is now available to the public. 

Plasmid pREV2^ was deposited in the E. coli JM103 host with NRRL on July 30, 1986. E. 
coli JM103(pREV2.2) received the accession number NRRL B-18091. NRRL B-15904 and NRRL 
B- 18091 will be available, without restrictions, to the public upon the grant of a patent which discloses 
them. 

20 Other E. coli strains, disclosed herein, were deposited as follows: 

E. coli SG20251, NRRL B-15918, was deposited on December 12, 1984. 
E. coli CAG629(pKHl), NRRL B-18095, was deposited on July 30, 1986. 
This latter deposit can be subjected to standard techniques to separate the plasmid from the 
host eel), and, thus, use the host g. coli GAG629 as disclosed herein. 
25 The subject cultures have been deposited under conditions that assure that access to the 

cultures will be available during the pendency of the patent application disclosing them to one 
determined by the Commissioner of Patents and Trademarks to be entitled thereto under 37 CFR 1.14 
and 35 USC 122, The deposits arc available as required by foreign patent laws in countries wherein 
counterparts of the subject application, or its progeny, are filed. However, it should be understood 
30 that the availability of a deposit does not constitute a license to practice the subject invention in 

derogation of patent rights granted by governmental action. 

Further, the subject culture deposits will be stored and made available lo the public in accord 
with the provisions of the Budapest Treaty for the Deposit of Microorganisms, Le^ they will be stored 
with all the care necessary to keep them viable and uncontaminated for a period of at least five years 
35 after the most recent request for the furnishing of a sample of the deposits, and in any case, for a 
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period of at least 30 (thirty) years after the date of deposit or for the enforceable life of any patent 
which may issue disclosing the cultures. The depositor acknowledges the duty to replace the deposits 
should the depository be unable to furnish a sample when requested, due to the condition of the 
deposits. All restrictions on the availability to the public of the subject culture deposits will be 
5 irrevocably removed upon the granting of a patent disclosing them. 

The novel HIV proteins and peptides of the subject invention can be expressed in 
Saccharomvces cercvisiae using plasmids containing the Inducible galactose promoter from this 
organism (Broach, J.R., Y. Li, L.C Wu, and M. Jayaram [1983] in Experimental Manipulation of Gene 
Expression, p. 83, ed. M Inouye, Academic Press). These plasmids are called YEp51 and YEp52 
10 (Broach, J.R. ei a) [1983]) and contain the E^goji origin of replication, the gene for ^-lactamase, the 

yeast LEU2 gene, the 2 jim origin of replication and the 2^m drcJe REP3 locus. Recombinant gene 
expression is driven by the yeast GAL10 gene promoter. 

Yeast promoters such as galactose and alcohol dehydrogenase (Bennet2en, J.L. and B,D. Hall 
[1982] J. BioL Chem. 2573018; Ammerer, G. [1983] in Methods in En2ymology Vol. 101, p. 192), 
15 phosphoglycerate kinase (Derynck, R., Rj\. Hitzeman, P.W. Oray, D.V. Goeddel [1983] in 

Experimental Manipulation of Gene Expression, p. 247, ed. M. Inouye, Academic Press), triose 
phosphate isomerase (Alber, T. and G. Kawasaki [1982]* J. Molec and Applied GeneL 1:419), or 
enolasc (Innes, MA et al. [1985] Science 226:2 1) can be used. 

The genes disclosed herein can be expressed in simian cells. When the genes encoding these 
20 proteins are cloned into one of the plasmids as described in Okayama and Berg (Okayama, R and 

P. Berg [1983] Molec and CelL BioL 3:280) and references therein, or COS cells transformed with 
these plasmids, synthesis of HTV proteins can be detected immunological ry. 

Other mammalian cell gene expression/protein production systems can be used. Examples of 
other such systems are the vaccinia virus expression system (Moss, B. [1985] Immunology Today 6243; 
25 Chakrabarti, S„ K. Brechling, B. Moss [1985] Molec. and CelL BioL 53403) and the vectors derived 

from murine retroviruses (Mulligan, R.C [1983] in Experimental Manipulation of Gene Expression, 
p. 155, ed. M. Inouye, Academic Press). 

The HIV proteins and peptides of the subject invention can be chemically synthesized by solid 
phase peptide synthetic techniques such as BOC and FMOC (Merrifield, R.B. [1963] J. Amcr. Chem. 
30 Soc. 85:2149; Chang, C and J, Meienhofer [1978] Int. J. Peptide Protein Res. 11:246). 

As is well known in the art, the amino add sequence of a protein is determined by the 
nucleotide sequence of the DN A Because of the redundancy of the genetic code, It, more than one 
coding nucleotide triplet (codon) can be used for most of the amino adds used to make proteins, 
different nucleotide sequences can code for a particular amino add. Thus, the genetic code can be 
35 depicted as follows: 
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Phenylalanine (Pbe) 


TTK 


Histidrae (His) 


CAK 


Leucine (Leu) 


XTY 


Olutamine (Gin) 


CAJ 


Isoleucine (He) 


ATM 


Asparagine (Asn) 


AAK 


Methionine (Met) 


ATG 


Lysine (Lys) 


AAJ 


Valine (Val) 


GTL 


As partic acid (Asp) 


GAK 


Serine (Ser) 


QRS 


Glutamic acid (Glu) 


GAJ 


Proline (Pro) 


CCL 


Cysteine (Cys) 


TGK 


Threonine (Thr) 


ACL 


Tryptophan (Trp) 


TGG 


Alanine (Ala) 


GCL 


Argininc (Arg) 


WGZ 


Tyrosine (Tyr) 


TAK 


Glycine (Gly) 


GGL 


Termination signal 


TAJ 






Termination signal 


TOA 







Key: Each 3-Ietter deoxynucleotide triplet corresponds to a trinucleotide of mRNA, having a 5 '-end 
on the left and a 3'-end on the right All DNA sequences given herein are those of the strand whose 
15 sequence corresponds to the mRNA sequence, with thymine substituted for uracil. The letters stand 

for the purine or pyrimidine bases forming the deoxynucleotide sequence. 

A — adenine 

G a guanine 

C = cytosine 
20 T «= thymine 

X^TorCifYisAorG 

X - CifYisCorT 

Y = A, G, C or T if X is C 

Y = AorGifXisT 

25 W = C or A if Z is A or G 

W«=CifZisCorT 

Z = A, G, C or T if W is C 

Z^AorGifWisA 

QR «= TC if S is A, G, C or T; alternatively 
30 QR = AG if S is T or C 

J = A or G 

K = T or C 

L — A, T, C or G 

M = A, C or T 



35 
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The above shows that the novel amino acid sequences of the HIV proteins and peptides of 
the subject invention can be prepared by nucleotide sequences other than those disclosed herein. 
Functionally equivalent nucleotide sequences encoding the novel amino acid sequences of these HIV 
proteins and peptides, or fragments thereof having HIV antigenic or immunogenic or therapeutic 
5 activity, can be prepared by known synthetic procedures. Accordingly, the subject invention includes 

such functionally equivalent nucleotide sequences. 

Thus the scope of the subject invention includes not only the specific nucleotide sequences 
depicted herein, but also all equivalent nucleotide sequences coding for molecules with substantially 
the same HIV antigenic or immunogenic or therapeutic activity. 
10 Further, the scope of the subject invention is intended to cover not only (he specific amino 

acid sequences disclosed, but also similar sequences coding for proteins or protein fragments having 
comparable ability to induce the formation of and/or bind to specific HTV antibodies possessing the 
properties of virus neutralization. 

The term "equivalent" is being used in its ordinary patent usage here as denoting a nucleotide 
15 sequence which performs substantially as the nucleotide sequence identified herein to produce 

molecules with substantially the same HTV antigenic or immunogenic or therapeutic activity in 
essentially the same land of hosts. Within this definition are subfragments which have HIV antigenic 
or immunogenic or therapeutic activity. 

As disclosed above, it is well within the skill of those in the genetic engineering art to use the 
20 nucleotide sequences encoding HTV antigenic or immunogenic or therapeutic activity of the subject 

invention to produce HIV proteins via microbial processes. Fusing the sequences into an expression 
vector and transforming or transfecting into hosts, either eukaryotic (yeast or mammalian cells) or 
prokaryotic (bacterial cells), are standard procedures used in producing other well-known proteins, eg., 
insulin, interferons, human growth hormone, IL-1, IL-2, and the like. Similar procedures, or obvious 
25 modifications thereof, can be employed to prepare HTV proteins or peptides by microbial means or 

tissue-culture technology in accord with the subject invention. 

The nucleotide sequences disclosed herein can be prepared by a "gene machine" by procedures 
well known in the art. This is possible because of the disclosure of the nucleotide sequence. 

The restriction enzymes disclosed can be purchased from Bethesda Research Laboratories, 
Gaithersburg, MD, or New England Biolabs, Beverly, MA The enzymes are used according to the 
instructions provided by the supplier. 

The various methods employed in the preparation of the plasmids and transformation of hosi 
organisms are well known in the art These procedures are all described in Maniatis, T., E.F. Fritsch. 
and J. Sambrook (1982) Molecular Cloning A Labor atory Manual. Cold Spring Harbor Laboraiory, 
New York. Tnus, it is within the skill of those in the genetic engineering an to extract DNA from 
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microbial cells, perform restriction enzyme digestions, electropborese DNA fragments, tail and anneal 
plasmid and insert DNA, ligate DNA, transform cells, eg., |L cpJU cells, prepare plasmid DNA, 
electropborese proteins, and sequence DNA. 

Immunochemical assays employing the HIV proteins or peptides of the invention can take a 
5 variety of forms. One preferred type is a liquid phase assay wherein the HIV antigen and the sample 

to be tested are mixed and allowed to form immune complexes in solution which are then detected 
by a variety of methods. Another preferred type is a solid phase immunometric assay. In solid phase 
assays, an HIV protein or peptide is immobilized on a solid phase to form an antigen- 
immunoadsorbent. The immunoadsorbent is incubated with the sample to be tested. After an 
10 appropriate incubation period, the immunoadsorbent is separated from the sample, and labeled anti- 

(human IgG) antibody is used to detect human anti-HTV antibody bound to the immunoadsorbent. 
The amount of label associated with the immunoadsorbent can be compared to positive and negative 
controls to assess the presence or absence of anti-HIV antibody. 

The immunoadsorbent can be prepared by adsorbing or coupling a purified HIV protein or 
15 peptide to a solid phase. Various solid phases can be used, such as beads formed of glass, polystyrene, 

polypropylene, dextran or other material Other suitable solid phases include tubes -or plates formed 
from or coated with these materials. 

The HTV proteins or peptides can be either covalently or non-covalently bound to the solid 
phase by techniques such as covalent bonding via an amide or ester linkage or adsorption. After the 
20 HIV protein or peptide is affixed to the solid phase, the solid phase can be post-coated with an animal 

protein, eg., 3% fish gelatin. This provides a blocking protein which reduces nonspecific adsorption 
of protein to the immunoadsorbent surface. 

The immunoadsorbent is then incubated with the sample to be tested for anti-HIV antibody. 
In blood screening, blood plasma or scrum is used. The plasma or serum is diluted with normal 
25 animal plasma or serum. The diluent plasma or serum is derived from the same animal species that 

is the source of the anti-(human IgG) antibody. The preferred anti-(human IgG) antibody is goat 
anti-(human IgG) antibody. Thus, in the preferred format, the diluent would be goat serum or plasma. 

The conditions of incubation, eg., pH and temperature, and the duration of incubation are 
not crucial. These parameters can be optimized by routine experimentation. Generally, the incubation 
will be run for 1-2 hr at about 45°C in a buffer of pH 7-8. 

After incubation, the immunoadsorbeni and the sample are separated. Separation can be 
accomplished by any conventional separation technique such as sedimentation or cenirifugaiion. The 
immunoadsorbent then may be washed free of sample to eliminate any interfering substance. 

The immunoadsorbent is incubated with the labeled anli-(human IgG) antibody (tracer) to 
detect human antibody bound thereto. Generally the immunoadsorbent is incubated with a solution 



30 



35 



WO 90/03984 



PCI7US89/04302 



19 

of the labeled anti-(human IgO) antibody which contains a small amount (about 1%) of the serum 
or plasma of the animal species which serves as the source of the an ti -(human IgO) antibody. Anti- 
(human IgG) antibody can be obtained from any animal source. However, goat anti-(human IgO) 
antibody is preferred. The anti-(human IgO) antibody can be an antibody against the Fc fragment of 
5 human IgG, for example, goat anti-(human IgG) Fc antibody. 

The anti-(human IgO) antibody or anti-(human IgO) Fc can be labeled with a radioactive 
material such as 125 1; labeled with an optical label, such as a fluorescent material; or labeled with an 
enzyme such as horseradish peroxidase. The anti-human antibody can also be biotinylated and labeled 
avidin used to detect its binding to the immunoadsorbent 

10 After incubation with the labeled antibody, the immunoadsorbent is separated from the 

solution and the label associated with the immunoadsorbent is evaluated. Depending upon the choice 
of label, the evaluation can be done in a variety of ways. The label may be detected by a gamma 
counter if the label is a radioactive gamma emitter, or by a fluorimeter, if the label is a fluorescent 
material. In the case of an enzyme, label detection may be done colorimetrically employing a substrate 

15 for the enzyme. 

The amount of label associated with the immunoadsorbent is compared with positive and 
negative controls in order to determine the presence of anti-HTV antibody. The controls are generally 
run concomitantly with the sample to be tested. A positive control is a serum containing antibody 
against HTV; a negative control is a serum from healthy individuals which does not contain antibody 

20 against HTV. 

For convenience and standardization, reagents for the performance of the immunometric assay 
can be assembled in assay kits. A kit for screening blood, for example, can include: 

(a) an immunoadsorbent, eg., a polystyrene bead coated with an HIV protein or peptide; 

(b) a diluent for the serum or plasma sample, eg. normal goat serum or plasma; 

25 ( C ) an anti-(human IgO) antibody, tg^ goat an ti -(human IgG) antibody in buffered, 

aqueous solution containing about 1% goat serum or plasma; 

(d) a positive control, &g^ serum containing antibody against at least one of the novel 
HTV proteins or peptides; and 

(e) a negative control, eg., pooled sera from healthy individuals which does not con urn 
3° antibody against at least one of the novel HIV proteins or peptides. 

If the label is an enzyme, an additional element of the kit can be the substrate for the enzyme. 
Another type of assay for anti-HIV antibody is an antigen sandwich assay. In this assay, a 
labeled HJV protein or peptide is used in place of anti-(human IgO) antibody to detect anti-HIV 
antibody bound to the immunoadsorbenL The assay is based in principle on the Mvalency of antibody 
35 molecules. One binding site of the antibody binds the antigen affixed to the solid phase; the second 
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is available for binding the labeled antigen. The assay procedure is essentially the same as described 
for the immunometric assay except that after incubation with the sample, the immunoadsorbent is 
incubated with a solution of labeled HTV protein or peptide. HIV proteins or peptides can be labeled 
with radioisotope, an enzyme, etc. for this type of assay. 
5 In a third format, the bacterial protein, protein A, which binds the Fc segment of an IgG 

molecule without interfering with the antigen-antibody interaction can be used as the labeled tracer 
to detect ami -HIV antibody adsorbed to the immunoadsorbent Protein A can be readily labeled with 
a radioisotope, enzyme, or other detectable species. 

Immunochemical assays employing an HIV protein or peptide have several advantages over 
10 those employing a whole (or disrupted) virus. Assays based upon an HIV protein or peptide will 

alleviate the concern over growing large quantities of infectious virus and the inherent variability 
associated with cell culturing and virus production. Further, the assay will help mitigate the real or 
perceived fear of contracting AIDS by technicians in hospitals, clinics and blood banks who perform 
the test 

15 Immunochemical assays employing recombinant envelope proteins from multiple viral variants 

have additional advantages over proteins from a single HTV variant Assays incorporating protein 
sequences from multiple variants are more likely to accurately survey antibodies in the human 
population infected with diverse HTV variants. Also, solid phase enzyme-linked immunosorbent assay 
(ELISA) utilizing different HTV variant proteins would allow determination of prevalent serotypes in 

20 different geographic locations. This determination has not been possible until now as no available 

antibody detection kit utilizes more than one HTV variant 

Another use of recombinant proteins from HTV variants is to elicit variant-specific antisera 
in test animals. This antiserum would provide a reagent to identify which viral variant infected an 
individual Currently, "virus typing" can only be done by viral gene cloning and sequencing. Binding 

25 of variant-specific serum to a patient viral isolate would provide a means of rapid detection not 

currently available. For example, sera raised to the proteins denoted PBl niB , PBIrf, PB1 mn , PB1 SC) 
and PBly/Mj2 can be used to screen viral isolates from patients to determine which HIV variant the 
clinical isolate most closely resembles. This "screening" can be done by a variety of known antibody, 
antigen binding techniques. 

30 Vaccines comprising one or more of the HIV proteins or peptides, disclosed herein, and 

variants thereof having antigenic properties, can be prepared by procedures well known in the art. 
For example, such vaccines can be prepared as injectables, eg., liquid solutions or suspensions. Solid 
forms for solution in, or suspension in, a liquid prior to injection also can be prepared. Optionally, 
the preparation also can be emulsified. The active antigenic ingredient or ingredients can be mixed 

35 with excipients which are pharmaceuUcaUy acceptable and compatible with the active ingredient. 
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Examples of suitable recipients are water, saline, dextrose, glycerol, etbanol, or the like, and 
combinations thereof. In addition, if desired, the vaccine can contain minor amounts of auxiliary 
substances such as wetting or emulsifying agents, pH buffering agents, or adjuvants such as aluminum 
hydroxide or muramyl dipeptJde or variations thereof. In the case of peptides, coupling lo larger 
5 molecules such as KLH sometimes enhances iramunogenidty. The vaccines are conventionally 

administered parenterally, by injection, for example, either subcutaneousry or Intramuscularly. 
Additional formulations which are suitable for other modes of administration include suppositories 
and, in some cases, oral formulations. For suppositories, traditional binders and carriers include, for 
example, poh/alkalene glycols or triglycerides. Suppositories can be formed from mixtures containing 

10 the active ingredient in the range of about 0.5% to about 10%, preferably about 1 to about 2%. Oral 

formulations can include such normally employed excipients as, for example, pharmaceutical grades 
of mannitol, lactose, starch, magnesium stearate, sodium saccharine, cellulose, magnesium carbonate, 
and the like. These compositions can take the form of solutions, suspensions, tablets, pills, capsules, 
sustained release formulations or powders and contain from about 10% to about 95% of active 

15 ingredient, preferably from about 25% to about 70%. 

The compounds can be formulated into the vaccine as neutral or salt forms. Pharmaceutical ty 
acceptable salts include the acid addition salts (formed with the free amino groups of the peptide) and 
which are formed with inorganic acids such as, for example, hydrochloric or phosphoric acids, or such 
organic acids as acetic, oxalic, tartaric, mandelic, and the like. Salts formed with the free carboxyl 

20 groups can also be derived from inorganic bases such as, for example, sodium, potassium, ammonium, 

calcium, or ferric hydroxides, and such organic bases as isopropylamine, trimethylamine, 2-ethylamino 
ethanol, histidine, procaine, and the like. A vaccine composition may include peptides containing T 
helper cell epitopes in combination with protein fragments containing the principal neutralizing 
domain. Several of these epitopes have been mapped within the HTV envelope, and these regions have 

25 been shown to stimulate proliferation and h/mpholdne release from lymphocytes. Providing both of 

these epitopes in a vaccine may result in the stimulation of both the humoral and the cellular immune 
responses. 

Alternatively, a vaccine composition may include a compound which functions to increase the 
general immune response. One such compound is inierleuldn-2 (IL-2) which has been reported to 
30 enhance immunogenidty by general immune stimulation (Nunberg et al. (1988] In New Chemical and 

Genetic Approaches to Vacdnatlori. Cold Spring Harbor Laboratory, Cold Spring Harbor, NY). ILr 
2 may be coupled with an HIV peptide or protein comprising the PND to enhance the efficacy of 
vaccination. 

The vaccines are administered in a manner compatible with the dosage formulation, and in 
35 such amount as will be therapeutically effective and immunogenic. The quantity to be administered 
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depends on the subject to be treated, capacity of the subject's immune system to synthesize antibodies, 
and the degree of protection desired. Precise amounts of active ingredient required to be administered 
depend on the judgment of the practitioner and are peculiar to each individual However, suitable 
dosage ranges are of the order of about several hundred micrograms active ingredient per individual 
5 Suitable regimes for initial administration and booster shots are also variable, but are typified by an 

initial administration followed in one or two week intervals by a subsequent injection or other 
administration. 

HIV is known to undergo amino add sequence variation, particularly in the envelope gene 
(Starcich, B.R. [1986] Cell 45:637-648; Hahn, B.H. et at [1986] Science 232:1548-1553). Over 100 

10 variants have been analyzed by molecular cloning and restriction enzyme recognition analysis, and 

several of these have been analyzed by nucleotide sequencing. Some of these are the HIV isolates 
known as RF (Popovic, M. et al. [1984] Science 224:497-500), WMJ-1 (Hahn, B.R et aL [1986] 
Science 232:1548-1553), LAV (Wain-Hobson, S. et al. [1985] Cell 40:9-17), and ARV-2 (Sanchez- 
Pescador, R. et al. [1985] Science 227:484-492). HIV peptides from different viral isolates 

15 can be used in vaccine preparations to protect against infections by different HTV isolates. Further, 

a vaccine preparation can be made using more than one envelope protein fragment corresponding to 
the principal neutralizing domain of more than one HTV isolate to provide immunity and thus give 
better protection against AIDS. Alternatively, the vaccine preparation can be made using a single 
protein fragment that is comprised of a tandem arrangement of principal neutralizing epitopes from 

20 more than one HIV isolate. By identifying the principal neutralizing domain of HIV, this polypeptide 

region can be applied to formulate valuable vaccine, diagnostic, and therapeutic reagents. 

Antibodies to recombinant peptides disclosed herein are useful as therapeutic and prophylactic 
reagents. The generation of polyclonal or monoclonal antibodies capable of neutralizing a variety of 
HTV variants could be used to reduce the incidence of accidental infection and treat HTV infected 

25 people that are immuno-comprornised. Additionally, immunization regimens may elicit polyclonal sera 

capable of broadly neutralizing several variants of HIV. The ability to neutralize multiple HTV 
variants is termed broadly neutralizing antibody. Broadly neutralizing antibody may neutralize two or 
more HTV variants or all HTV variants. Therefore, a mixture of broadly neutralizing antibodies that 
neutralize different groups of HTV variants would be useful for diagnosis, prophylaxis, and therapy of 

30 AIDS. 

It is surprising and advantageous that immunization with peptides from five HTV variants 
would yield sera capable of neutralizing more than these five HrV variants when immunization wilb 
two docs not. Example 22 shows that immunization with five peptides elicits broadly neutralizing sera. 
Broadly neutralizing sera may also be generated if several sequences from the hypervariable region of 
35 diverse HTV variants are presented as a single synthetic peptide. Additionally, one may elicit this 
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broadly neutralizing sera by reimmunization of animals primed with RP136 or equivalent proteins with 
peptides containing only the conserved amino acids within this hypervariable region. These 
immunization regimens would be useful for vaccination and for deriving antibodies useful as 
therapeutic agents. 

5 Polyvalent immune globulin for use in passive immunization can be prepared by immunization 

of horses or by pooling immune human sera and fractionation of the IgG component from plasma or 
sera. Human or mouse monoclonal antibody producing cell lines may be prepared by standard 
transformation and hybridoma technology (Methods in PnTymn^n gY, VoL 121, Sections I and II [1986] 
eds, J J. Langone and H.V. Vunakis, Academic Press). HIV monoclonal antibody can be prepared in 
10 accord with the procedures disclosed by Matsushita et al. (Matsushita et al. [1988] Journal of Virology 

62(6)^107-2114). Since, for the most part, monoclonal antibodies are produced in species other than 
humans, they are often immunogenic to humans. In order to successfully use these monoclonal 
antibodies in the treatment of humans, it may be necessary to create a chimeric antibody molecule 
wherein the portion of the polypeptide involved with ligand binding (the variable region) is derived 
15 from one species, and the portion involved with providing structural stability and other biological 

functions (the constant region) is derived from a human antibody. Methods for producing chimeric 
antibodies in which the variable domain is derived from one host and the constant domain is derived 
from a second host are well known to those skilled in the art See, for example, Neuberger et aL, 
WO Publication No. 86/01533, priority 9/3/84; Morrison et aL, EP Publication No. 0 173 494, priority 
8/27/84. An alternative method, in which an antibody is produced by replacing only the 
complementarity determining regions (CDRs) of the variable region with the CDRs from an 
immunoglobulin of the desired antigenic specificity, is described by Winter (OB Publication No. 2 188 
638, priority 3/27/86). Murine monoclonals can be made compatible with human therapeutic use by 
producing an antibody containing a human Fc portion (Morrison, S.L. {1985] Science 229:1202-1207). 
Established procedures would allow construction, expression, and purification of such a hybrid 
monoclonal antibody. Regimens for administering immune globulin therapeutically have previously 
been used for a number of infectious diseases. 

As used herein, the term 'antibody" is meant to encompass monoclonal or polyclonal 
antibodies, whole, intact antibodies or antibody fragments having the immunological activity of the 
whole antibody. Also encompassed within the term 'antibody- arc chimeric antibodies having the 
variable and constant regions from different host species, or those wherein only the CDRs are 
replaced. 

For treatment of HIV infection, compositions comprising antibodies may be administered to 
an individual or animal in need of treatment. Alternatively, the HIV antigens described here may be 
administered in order to stimulate the recipient's own immune response. When treating with an HIV 
antigen, a single antigen may be administered or, preferably, a broadly neutralizing antigen or mixture 
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of antigens may be administered. Such compositions are described in detail in the examples which 
follow. 

The ability to modify peptides made by organic synthesis can be advantageous for diagnostic, 
therapeutic, and prophylactic use by improving efficiency of immobilization, Increasing protein stability, 
5 increasing immunogenicity, altering immunogenicity, reducing toxicity, or allowing multiple variations 

simultaneously. For example, peptides can be modified to increase or decrease net charge by 
modification of amino or carboxyl groups (carbamylation, trifluoroacetylation or succinylation of amino 
groups; acetylation of carboxyl groups). Peptides can be made more stable by, for example, inclusion 
of D-amino acids or tfrcularization of the peptide. Reductive state of peptides can be altered by, for 

10 example, sulfonation of cystinyl groups. Peptides can also be modified covalently or non-covalenlly 

with non-protcinaceous materials such as lipids or carbohydrates to enhance immunogenicity or 
solubility. Polyethylene glycol can be used to enhance solubility. The subject invention includes all 
such chemical modifications of the proteins and peptides disclosed herein so long as the modified 
protein and/or peptide retains substantially all the antigenic/immunogenic properties of the parent 

15 compound. 

Peptides can also be modified to contain antigenic properties of more than one viral variant. 
This has been done, for example, with Foot and Mouth Disease virus. 

Foot and Mouth Disease virus is similar to HIV in that multiple variants exist and 
immunization with one variant does not lead to protection against other variants. The real utility of 
20 peptides as immunogens is demonstrated by eliciting immunity to more than one variant by 

modification of the peptide to possess properties of both natural variants. When such a modified 
variant was used to immunize test animals, they were protected against both Foot and Mouth vims 
strains A10 and A12 (Brown, F. in Vims Vaccines, ed. G. Dreesman, J. Bronson, R. Kennedy, pp. 49- 
54 [1985]). 

25 HIV peptides or proteins containing a PND epitope can also be coupled with or incorporated 

into an unrelated virus particle, a replicating virus, or other microorganism in order to enhance 
immunogenicity. The HTV epitope may be genetically or chemically attached to the viral panicle or 
microorganism or an immunogenic portion or component thereof. Antigenic epitopes have been 
attached to viral proteins or particles to enhance the immune response. For example, the VP6 capsid 

30 protein of rotavirus has been used as an immunologic carrier protein for an epitope of interest either 

in the monomelic form or as oligomers of VP6 in the form of particles (EP Publication No. 0 259 
149). Similarly, Evans et al. (1989, Nature 339385) have constructed chimaeras of the poliovirus 
capsid protein and an epitope of Hrv gp41 to enhance immunogenicity of the HIV epitope. Foreign 
antigenic determinants have also been expressed and presented by bacterial cells. A Salmonella strain 

35 expressing a cloned Salmonella flagellin gene, into which was inserted an epitope of cither cholera 
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toxin or hepatitis B surface antigen, was reported to elicit both cellular and humoral responses to the 
inserted epitopes (Newton et aL [1989] Science 244:70-72; and Wu et aL [1989J Proa Natl. Acad. ScL 
86:4726-4730). 

Example 18 shows that a peptide containing amino acid sequences from two HIV variants can 
5 block virus neutralization activity of two virus specific neutralizing antisera. This suggests that a 

peptide or protein containing sequences of two or more HIV variants can elicit an immune response 
effective against two or more HIV variants. 

Example 19 shows that co-immunization with envelope proteins from two HIV isolates elicits 
an immune response capable of neutralizing two HIV isolates. This suggests that co-immunization 
10 with proteins from two or more HIV variants can elicit an immune response effective against two or 

more HTV variants. 



Following arc examples which illustrate the process of the invention, including the best mode. 
These examples should not be construed as limiting. All solvent mixture proportions are by volume 
15 unless otherwise noted. 



Example 1 - Construction of plasmid PREV2.2 

The pREVZ2 plasmid expression vector was constructed from plasmid pBOl. Plasmid pBGl 
can be isolated from its E. coli host by well known procedures, e.g^ using cleared rysate-isopyenic 
20 density gradient procedures, and the like, like pBGl, pREV2_2 expresses inserted genes behind the 

IL coli promoter. The differences between pBOl and pREV2-2 are the following: 

1. pREVZ2 lacks a functional replication of plasmid (Top) protein. 

2. pREV2-2 has the ttpA transcription terminator inserted into the Aatll site. This 
sequence insures transcription termination of over-expressed genes. 

25 3. pREV2-2 has genes to provide resistance to ampicfllin and chloramphenicol, whereas 

pBGl provides resistance only to ampicfllin. 
4. pREV2.2 contains a sequence encoding sites for several restriction endo nucleases. 
The following procedures were used to make each of the four changes listed above: 
la. 5 jig of plasmid pBGl was restricted with £WeI, which gives two fragments of 
30 approximately 2160 and 3440 base pairs. 

lb. 0.1 fig of DNA from the digestion mixture, after inactrvation of the £4dei, was treated 
with T4 DNA ligase under conditions that favor intramolecular ligation (200 p*\ 
reaction volume using standard T4 ligase reaction conditions [New England Biolabs, 
Beverly, MA]). Intramolecular ligation of the 3440 base pair fragment gave an 
ampidlUn resistant plasmid. The ligation mixture was transformed into the recipient 



35 



PCT/US89/04302 



26 

strain E. co|i JM1Q3 (available from New England Biolabs) and ampicillin resistant 
clones were selected by standard procedures. 

The product plasmid, pBGlAN, where the 2160 base pair Nde l fragment is deleted 
from pBGl, was selected by preparing plasmid from ampicillin resistant clones and 
determining the restriction digestion patterns with Nde l and £ajl (product fragments 
approximately 1790 and 1650). This deletion inactivates the rjoj> gene that controls 
plasmid replication. 

5 fig of pBGl N was then digested with EcoRI and ggll and the larger fragment, 
approximately 24S5 base pairs, was isolated. 

A synthetic double stranded fragment was prepared by the procedure of Itakura et al. 

(ltakura, K., JJ. Rossi, and R.B. Wallace [1984] Ann. Rev. Biochem. 53:323-356, and 

references therein) with the following structure: 

5' G ATCAAQCITCTGCAGTCOACG CAT 

3' TTCGAAG ACGTCAG CTGCGTACGCC 

GCGG ATCCG GTACCCG GG AG CTCG V 
TAG G CCATG G G CCCTCG A G CTTAA 5' 

This fragment has Bell and EcoRI sticky ends and contains recognition sequences for 

several restriction endonucleases. 

0.1 /ig of the 2455 base pair EcoRI-BclI fragment and 0.01 fig of the synthetic 
fragment were joined with T4 DNA ligase and competent cells of strain JM1Q3 were 
transformed. Cells harboring the recombinant plasmid, where the synthetic fragment 
was inserted into pBGlAN between the Bell and EcoRI sites, were selected by 
digestion of the plasmid with Hpal and J=coRI. The diagnostic fragment sizes are 
approximately 2355 and 200 base pairs. This plasmid is called pREVl. 
5 fig of pREVl were digested with AatU, which cleaves uniquely. 
The following double-stranded fragment was synthesized: 
5* CGGTACCAGCCCGCCTAATG 
3' TGCAGCCATGGTCGGGCGGA 

AGCGGGCTTTTTTTTGACGT3' 
TTACTCG CCCG AAAAAAAAC 5* 
This fragment has AatU sticky ends and contains the trpA transcription termination 
sequence. 

0.1 fig of AatU digested pREVl was ligated with 0.01 fig of the synthetic fragment in 
a volume of 20 fi\ using T4 DNA ligase. 
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2g. Cells of strain JM103, made competent, were transformed and ampicillin resistant 
clones selected, 

2h. Using a Kpnl, EcqRI double restriction digest of plasmid isolated from selected 
colonies, a cell containing the correct construction was isolated. The sizes of the 
5 Kpn l. EcoRI generated fragments are approximately 2475 and 80 base pairs. This 

plasmid is called pREVITT and contains the trpA transcription terminator. 
3a. 5 fig of pREVITT, prepared as disclosed above (by standard methods) was cleaved 

with Nde l and Xmn I and the approximately 850 base pair fragment was isolated. 
3b. 5 fig of plasmid pBR325 (BRL, Gaithersburg, MD), which contains the genes 
10 conferring resistance to chloramphenicol as well as to amplcfllin and tetracycline, was 

cleaved with gcll and the ends blunted with Klenow polymerase and dexoynucleoiides. 
After inactivating the enzyme, the mixture was treated with Nde l and the 
approximately 3185 base pair fragment was isolated. This fragment contains the genes 
for chloramphenicol and ampicillin resistance and the origin of replication. 
15 3c. 0.1 fig of the Ndel- Xmn I fragment from pREVITT and the Ndel-Bcll fragment from 

pBR325 were ligated in 20 with T4 DNA ligase and the mixture used to transform 
competent cells of strain JM103. Cells resistant to both ampicillin and 
chloramphenicol were selected. 
3d. Using an EcoRI and Nde l double digest of plasmid from selected clones, a plasmid 
was selected giving fragment sizes of approximately 2480, 1145, and 410 base pairs. 
This is called plasmid pREVlTT/chl and has genes for resistance to both ampidllin 
and chloramphenicol 
4a. The following double-stranded fragment was synthesized: 
Mlul EcoRV dal BamHI Sail Hindni Smal 
25 5' COAACG CGTGGCCO ATATCATCOATOG - 

ATCCOTCGACAAGCTTCCCGGOAGCT 3" 
3* GCTTG CGCACCGGCTATAGTAGCTAC- 

CTAGGCAGCTGTTCGAAGGGCCC 5 1 
This fragment, with a blunt end and an Sstl sticky end, contains recognition sequences 
for several restriction enzyme sites. 
4b. 5 fig of pREVlTT/chl was cleaved with Nru l (which cleaves about 20 nucleotides from 
the Bell site) and gstl (which cleaves within the multiple cloning site). The larger 
fragment, approximately 3990 base pairs, was isolated from an agarose geL 
4c 0.1 fig of the Nrul-Sstl fragment from pREVlTT/chl and O01 fig of the synthetic 
fragment were treated with T4 DNA ligase in a volume of 20 
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4d This mixture was transformed into strain JM103 and arapicillin resistant clones were 
selected. 

4e. Plasmid was purified from several clones and screened by digestion with Mlu l or Oa l. 
Recombinant clones with the new multiple cloning site will give one fragment when 
5 digested with either of these enzymes, because each cleaves the plasmid once. 

4f. The sequence of the multiple cloning site was verified. This was done by restricting 
the plasmid with HpaJ and PyuII and isolating the 1395 base pair fragment, cloning 
it into the Sma l site of mpl8 and sequencing it by dideozynucleotide sequencing using 
standard methods. 
10 4g. This plasmid is called pREV2-2. 



Example 2 - Construction of the bacterial expression vector pREVZl 

Plasmid pREV2.1 was constructed using plasmid pREV2_2 and a synthetic oligonucleotide. 
The resulting plasmid was used to construct pPBl-Sub 1 and pPBl-Sub 2. 
15 An example of how to construct pREVXl is as follows: 

1. Plasmid pREV23 is cleaved with restriction enzymes Nrul and Bam H] and the 4 Kb 
fragment is isolated from an agarose gel. 

2. The following double-stranded oligonucleotide is synthesized: 

20 5» COAACOCOTGGTCCGATATCATCGATO 3' 

3' GCTTGCG CACCAGG CTATAGTAO CTACCTAG 5* 

3. The fragments from 1 and 2 are ligated in 20 /il using T4 DNA ligase, transformed 
into competent E. coli cells and chloramphenicol resistant colonies are isolated 

4. Plasmid clones are identified that contain the oligonucleotide from 2. spanning l he 
25 region from the Nrul site to the BamHI site and recreating these two restriction sites. 

This plasmid is termed pREV2.1. 



Example 3 - Construction of and expression from plasmid oPBl-Sub 1 

Plasmid pPBl-Sub 1, which contains approximately 165 base pairs (bp) of DNA encoding 
essentially the HIV eny gene from the PyuII site to the Dral site, and from which is synthesized an 
approximately 12 Kd fusion protein containing this portion of the gpl20 envelope protein can be 
constructed as follows: 

1. Restricting plasmid pPBlrgg with M]ul and Dral and isolating the approximately 165 
bp fragment. 
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2. Restricting plasmid pREVZl with Mlul and Sma l and isolating the Urge fragment, 
approximately 4 Kd, from an agarose geL 

3. Ligating the fragment prepared in 2. with the pREVZl fragment in a volume of 20 
fi\ using T4 DNA ligase, transforming the ligation mixture into competent cell strain 

5 CAG629, and selecting ampidllin-resistant transformants. 

4. Selecting such transformants, by appropriate restriction patterns, that have the gpl20 
fragment cloned in the proper orientation to generate a fusion protein. When the 
strain harboring this recombinant plasmid is grown in 2% medium containing 50/<g/ml 
ampicillin and the total complement of cellular proteins are eleciropboresed on a 

10 SDS-poh/acrylamide gel, a protein of approximately 12 Kd can be visualized by either 

coomassie blue staining or by Western blot analysis using as probe selected sera from 
HIV infected individuals. 

Example 4 - Purification of recom binant protein containing HTV envelope sequences from plasmid 
15 pPBl-Sub 1 

1. Growth of cells: Cells were grown in a 10-liier volume in a Chemap (Chemapec, 
Woodbury, NY) fermentor in 2% medium (2% yeast extract, bacto-trypione, casamino 
acids pifco, Detroit, MI], 0.2% potassium monobasic, 0.2% potassium dibasic, and 
02% sodium dibasic). Fermentation temperature was 30°C, the pH was 6.8, and air 
was provided at 1 wm. Plasmid selection was provided 20 jig/ml chloramphenicol. 
Typical cell yield (wet weight) was 30 g/L 

2. Cell h/sis: 50 g, wet cell weight, of & poli containing the recombinant HIV envelope 
fusion protein were resuspended in a final volume of 100 ml in 50 mM Tris-Q pH 
8.0, 5 mM potassium ethylenediamineietraacetic acid (KEDTA), 5 mM diibioihreilol 
(DTT), 15 mM 0-mercaptoethanol, 05% TRITON™X-100 (Pharmacia, Piscataway, 
NJ), and 5 mM phenylmethylsulfonyl fluoride (PMSF). The suspension was incubated 
for 30 min at room temperature. 

This material was hysed using a BEAD-BEATER™ (Biospec Products, 
Bartlesville, OK) containing an equal volume of 0.1-0.15 mm glass beads. The lysis 
was done for 6 min at room temperature in 1-min intervals. The liquid was separated 
from the beads and centrifuged for 25 hr at 20,000 xg. The supernatant was removed 
and the pellet was resuspended in 100 ml 8 M urea, 20 mM Tris-d pH 8.0, 5 mM 
DTT, 15 mM /S-mercaptocthanol, 5 mM PMSF, and 1 mM KEDTA. The pellet was 
solubilized using a polytron bomogenizer and centrifuged at 20,000 xg for 2 hr. 
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3. CM Chromatography: The dialysate was loaded onto a 100 ml column (2-5 era x 20 
cm) packed with CM FAST FLOW SEPHAROSE™ (Pharmacia) equilibrated in 8 
M urea, 10 mM 4-(2-hydroxyethyl-l-piperazine ethane-sulfonic add (HEPES) pH 6.5, 
15 mM ^-mercaptoethanol, and 1 mM KEDTA at room temperature. The column 

5 was washed with 200 ml equilibration buffer, and the protein eluted with a 1.0 liter 

linear gradient from 0-0.4 M NaCl. The HIV protein (12 Kd) eluted at approximately 
0.2 M NaCl as assayed by SDS-polyacrylamide gel electrophoresis. 

Further purification was obtained by pooling Sub 1 -containing fractions and 
applying to a S-200 (Pharmacia) gel filtration column equilibrated in the same buffer 

10 as the previous column. 



Example 5 - Construction of and expression from plasmid pPBl-Sub 2 

Plasmid pPBl-Sub 2, which contains approximately 320 bp of DNA encoding essentially the 
HTV eny gene from the PvuII site to the Seal site, and from which is synthesized an approximately 
15 18 Kd fusion protein containing this portion of the gpl20 envelope protein, can be constructed as 

follows: 

1. Restricting the pPBlma plasmid with Mlu l and Seal and isolating the approximately 
320 bp fragment 

2. Restricting plasmid pREVZl with Mlu l and Sma l and isolating the large fragment, 
20 approximately 4 Kd, from an agarose gel 

3. Ugating the fragment prepared in 2. with the pREVZl fragment in a volume of 20 
fi\ using T4 DNA ligase, transforming the ligation mixture into competent cell strain 
SG20251 and selecting ampidllin -resistant transformants. 

4. Selecting such transformants, by appropriate restriction patterns, thai have the gp!20 
25 fragment cloned in the proper orientation to generate a fusion protein. When the 

strain harboring this recombinant plasmid is grown in 2% medium containing 50/ig/ml 
ampidllin and the total complement of cellular proteins electropboresed on an SDS- 
poh/acrylamide gel, a protein of approximately 18 Kd can be visualized by either 
coomassie blue staining or by Western blot analysis using as probe selected sera from 
30 HIV infected individuals. 



Example 6 - Purification of recomb inant protein containing HTV envelope sequences from plasmid 
pPBl-Sub 2 

1. Growth of cells: Cells were grown in a 10-liter volume in a Chemap fermentor in 2% 
35 medium. Fermentation temperature was 37°C, the pH was 6A, and air was provided 
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at 1 vm Plasmid selection was provided by 20/ig/ml chloramphenicol. Typical cell 
yield (wet weight) was 30 g/1. 
Z Cell lysis: 50 g, wet cell weight, of E, coh containing the recombinant HIV envelope 
fusion protein were resuspended in a final volume of 100 ml in 50 mM Tris-Cl pH 

5 8.0, 5 mM KEDTA, 5 mM DTT, 15 mM ^-mercaptoeihanol, 0.5% TRITON™X- 

100, and 5 mM PMSP. The suspension incubated for 30 min at room temperature. 
This material was fysed using a BEAD-BEATER™ containing an equal 
volume of 0.1-0.15 mm glass beads. The lysis was done for 6 min at room 
temperature in 1 min intervals. The liquid was separated from the beads and 

0 centrifuged for 2-5 hr at 20,000 xg. The supernatant was removed and the pellet was 

resuspended in 100 ml 6 M guanidine-bydrochloridc, 20 mM Tris-Cl pH 8.0, 5 mM 
DTT, 15 mM 0-mercaptoethanol, 5 mM PMSF, and 1 mM KEDTA. The pellet was 
solubilized using a potytron homogenizer and centrifuged at 20,000 xg for 2 hr. 

The supemate (90 ml) was dialyzed against 4 liters of 8 M urea, 20 mM 

5 sodium formate, pH 4.0, 1 mM EDTA, and 15 mM ^-mercaptoelhanoL Dialysis was 

done each time for 8 hr or longer with three changes of buffer. Spectraphor dialysis 
tubing (S/P, McOraw Park, IL) with a 3-5 Kd MW cut-off was used. 
3. CM Chromatography: The diarysate was loaded onto a 100 ml column (2^ cm x 20 
cm) packed with CM FAST FLOW SEPHAROSE™ (Pharmacia) equilibrated in 8 

0 M urea, 20 mM sodium formate pH 4.0, 15 mM 0-mercaptoethanol, and 1 mM Na 

EDTA at room temperature. The column was washed with 200 ml equilibration 
buffer, and the protein eluted with a 1.0 liter linear gradient from 0-0.4 M Nad. The 
HIV protein (18 Kd) eluted at approximately 0.2 M Nad as assayed by SDS- 
polyacrylamide gel electrophoresis. 

5 Further purification was obtained by pooling Sub 2-coniaining fractions and 

applying to an S-200 (Pharmacia) ge! filtration column equilibrated in the same buffer 
as the previous column. 



Example 7 - Synthetic peptides 

Synthesis of peptides can be done by a variety of established procedures, for example, 
automated peptide synthesis. Peptides were assembled by solid-phase synthesis on cross-linked 
polystyrene beads starting from the carboxyl terminus and adding amino acids in a step-wise fashion 
(Merrifield, R.B. [1963] S. Am. Chem, Soc 85:2149). Each synthesis was performed on au automated 
peptide synthesizer (Applied Biosystems 430-A) using standard t-Boc chemistry. Amino adds were 
coupled as highly reactive symmetric anhydrides formed immediately prior to use. To minimize 
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coupling difficulties, dimethylformamide was used as the coupling buffer. The quamitative ninhydrin 
assay was used to measure the efficiency of coupling after each amino acid addition (Sarin, VJC, 
S.B.H. Kent, J.P. Tarn, R.B. Merrifield [1981] Anal. Biochem. 117:147 1981). 

All peptides were deprotected and cleaved from the polystyrene support using an alternative 
5 to HF cleavage. Resin containing peptide was resnspended in a mixture of trifluoroacetic acid, 

trifluorometbane sulfonic acid, and organic thiol scavengers (Tarn, J.P., W.F. Heath, TLB. Merrifield 
[1986] J. Am. Chem. Soc. 108:5242). Soluble peptide was precipitated with ethyl eiber and, after 
removing ether, resuspended in 200 mM sodium carbonate, 3 M guanidine HQ. The crude peptides 
were purified by reverse-phase chromatography on a 1.0 cm x 25 cm Vidac semi-prcparaiive Cj 8 

10 column. The buffers employed were: (A) 0.1% trifluoroacetic acid in H 2 0, and (B) 0.1% 

trifluoroacetic acid in 80% aceto nit rile/20% H 2 CX Gradient elution was utilized to elute the bound 
peptide and collected fractions were further analyzed to identify pure product. Peptide identity was 
confirmed by amino acid analysis following 6 N Ha hydrolysis. The synthesis included the addition 
of terminal amino adds not homologous to HTV for purposes of labeling, cross-linking, or structure 

15 of the peptide. These non-HJV amino acids are indicated in parenthesis. 

The product of synthesis can be further purified by a number of established separaiory 
techniques, for example, ion exchange chromatography. 

Example 8 - Construction of and expression from plasmid pPBlp p 
20 Plasmid pPBlRp, which contains approximately 565 bp of DNA encoding essentially the 

HIV RF eny gene from the JPyuII site to the Bel li site, and from which is synthesized an approximately 
27 Kd fusion protein containing this portion of the gpl20 envelope protein can be constructed as 
follows: 

1. Synthesizing DNA fragment in Table 4 A. 

25 Z Restricting plasmid pREV2.2 with EcoRV and Bam HI and isolating the large 

fragment, approximately 4 Kd, from an agarose get 
3. legating the fragment prepared in 1. with the pREVZ2 fragment in a volume of 20 
/t\ using T4 DNA ligase, transforming the ligation mixture into competent cell strain 
CAG 629, and selecting ampicillin-resistant transformants. 

30 4. Selecting such transformants, by appropriate restriction patterns, that have the gpl20 

fragment cloned in the proper orientation to generate a fusion protein. When the 
strain harboring this recombinant plasmid is grown at 32°C in 2% medium containing 
50 figfml ampirillin and the total complement of cellular proteins electrophorescd on 
an SDS-polyacrylamide gel, a protein of approximately 27 Kd can be visualized by 
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either coomassie blue staining or by Western blot analysis using as probe selected sera 
from HIV infected individuals. 



Example 9 - Purificati on of recombinant protein containing HTV envelope sequences from plasm id 

1. Growth of cells: Cells were grown in a 10-liter volume in a Chemap fermentor in 2% 
medium. Fermentation temperature was 30°C, the pH was 6.8, and air was provided 
at 1 win. Plasmid selection was provided by 20 /ig/ml chloramphenicol. Typical cell 
yield (wet weight) was 30 g/L 
10 2- Cell lysis: 50 g, wet cell weight, of coli containing the recombinant HIV envelope 

fusion protein were resuspended in a final volume of 100 ml in 50 mM Tris-Q pH 
8.0, 5 mM KEDTA, 5 mM DDT, 15 mM 0-mercaptoethanol, 05% TRITON™X- 
100, and 5 mM PMSF. 300 mg lysozymc was added and the suspension incubated for 
30 min at room temperature, 
15 This material was lysed using a BEAD-BEATER™ containing an equal 

volume of 0.1-0.15 mm glass beads. The lysis was done for 6 min at room 
temperature in 1 min intervals. The liquid was separated from the beads and 
centrifuged for 2.5 hr at 20,000 xg. The supernatant was removed and the pellet was 
resuspended in 100 ml 8 M urea, 20 mM Tris-Q pH 80, 5 mM DTT, 15 mM /3- 
mercaptoethanol, 5 mM PMSF, and 1 mM KEDTA. The pellet was solubilized using 
a poh/tron homogenizer and centrifuged at 20,000 xg for 2 hr. 

The superaate (90 ml) was diafysed against 4 liters of 8 M urea, 20 mM 
HEPES, pH 65, 1 mM EDTA, and 15 mM £-mercaptoethanol. Dialysis was done 
each time for 8 hr or longer with three changes of buffer. Speciropbor dialysis tubing 
25 with a 3.5 Kd MW cut-off was used. 

3. CM Chromatography: The diajysate was loaded onto a 100 ml column (US cm x 20 
cm) packed with CM FAST FLOW SEPHAROSE™ equilibrated in 8 M urea, 10 
mM HEPES pH 6.5, 15 mM ^-mercaptoethanol, and 1 mM Na EDTA at room 
temperature. The column was washed with 200 ml equilibrium buffer, and the protein 
eluted with a 1.0 liter linear gradient from 0-0.4 M Nad. The HTV protein (26 Kd) 
elutcd at approximately 0.2 M NaCl as assayed by SDS-polyacrylamide gel 
electrophoresis. 

Further purification was obtained by pooling PBl RF containing fractions and 
applying to an S-300 gel filtration column equilibrated in the same buffer as ihc 
35 previous column. 
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Example 10 - Construction of and expression from plasm id pPBlm n 

Plasmid pPBImn. which contains approximately 600 bp of DNA encoding essentially the 
HTV MN env gene from the Bel li site to the Bel li site, and from which is synthesized an approximately 
5 28 Kd fusion protein containing this portion of the gpl20 envelope protein, can be constructed as 

follows: 

1. Synthesizing DNA fragment in Table 5 A. 

2. Restricting plasmid pREV2.2 with Bam HL 

3. Ugating the fragment prepared in 1. with the pREV2.2 fragment in a volume of 20 
10 jUl using T4 DNA ligase, transforming the ligation mixture into competent cell strain 

CAG 629, and selecting ampicillin-resistant transfonnants. 

4. Selecting such transfonnants, by appropriate restriction patterns, that have the gp!20 
fragment cloned in the proper orientation to generate a fusion protein. When the 
strain harboring this recombinant plasmid is grown at 32°C in 2% medium containing 

15 50 ftg/ml ampidllin and the total complement of cellular proteins electrophoresed on 

an SDS-polyacrylainide gel, a protein of approximately 28 Kd can be visualized by 
either coomassie blue staining or by Western blot analysis using as probe selected sera 
from HIV infected individuals. 

20 Example 11 - Purification of reco mbinant protein containing HIV envelope sequences from plasmid 

1. Growth of cells: Cells were grown in a 10-liter volume in a Cbemap fermentor in 2% 
medium. Fermentation temperature was 30°C, the pH was 6.8, and air was provided 
at 1 wm. Plasmid selection was provided by 20 /Jg/ml chloramphenicol. Typical cell 

25 yield (wet weight) was 30 g/L 

2. Cell lysis: 50 g, wet cell weighs of E. coli containing the recombinant HIV envelope 
fusion protein were resuspended in a final volume of 100 ml in 50 mM Tris-a pH 
8.0, 5 mM KEDTA, 5 mM DTT, 15 mM ^-mercaptoethanol, 0-5% TRITON™X- 
100, and 5 mM PMSF. 300 rag lysozyme was added and the suspension incubated for 

30 30 min at room temperature. 

This material was lysed using a BEAD- BEATER™ containing an equal 
volume of 0.1-0.15 mm glass beads. The lysis was done for 6 min ai room 
temperature in 1 min intervals. The liquid was separated from the beads and 
centrifuged for 2-5 hr ai 20,000 xg. The supernatant was removed and the pellet was 

35 resuspended in 100 m! 8 M urea, 20 mM Tris-Q pH 8.0, 5 mM DTT, 15 mM fi- 
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mercaptoethanol, 5 mM PMSF, and 1 mM KEDTA. The pellet was solubilized using 
a polyiron homogenizer and cemrifuged at 20,000 xg for 2 hr. 

The superaate (90 ml) was diatysed against 4 liters of 8 M urea, 20 mM 
HEPES, pH 65, 1 mM EDTA, and 15 mM £-mercaptoethanoL Dialysis was done 
each time for 8 hr or longer with three changes of buffer. Spectraphor dialysis tubing 
with a 3.5 Kd MW cut-off was used, 

3. CM Chromatography: The diarysate was loaded onto a 100 ml column (2.5 cm x 20 
cm) packed with CM FAST FLOW SEPHAROSE™ equilibrated in 8 M urea, 10 
mM HEPES pH <L5, 15 mM 0-mercaptoetbanol, and 1 mM KEDTA at room 
temperature. The column was washed with 200 ml equilibration buffer, and the 
protein eluted with a 1.0 liter linear gradient from 0-0.4 M NaCl. The HIV protein 
(28 Kd) elmed at approximately 0.2 M Nad as assayed by SDS-polyacrylamide gel 
electrophoresis. 

Further purification was obtained by pooling PBl ^-containing fractions and 
15 applying to an S-300 gel filtration column equilibrated in the same buffer as the 

previous column. 

Example 12 - Construction of and expression from plasmid pPBl ?r 

Plasmid pPBl sc> which contains approximately 570 bp of DNA encoding essentially the H1V SC 
20 env gene from the PyuII site to the Bel li site, and from which Is synthesized an approximately 26 Kd 

fusion protein containing this portion of the gpl20 envolope protein, can be constructed as follows: 

1. Synthesizing DNA fragment in Table 6A. 

2. Restricting plasmid pREV2.2 with EcoRV and BamHI and isolating the large 
fragment, approximately 4 Kd, from the agarose gel 

25 3. Li gating the fragment prepared in 1. with the pREV23 fragment in a volume of 20 

fi\ using T4 DNA Ugase, transforming the ligation mixture into competent cell strain 
CAG 629, and selecting ampicOlin-res islam tranaformants. 

4. Selecting such transfonnants, by appropriate restriction patterns, that have the gpl20 
fragment cloned in the proper orientation to generate a fusion protein. When the 

30 strain harboring this recombinant plasmid is grown at 32°C in 2% medium containing 

50/^g/ml ampicillin and the total complement of cellular proteins electrophoresed on 
an SDS-poryacrylamide gel, a protein of approximately 26 Kd can be visualized by 
either coomassie blue staining or by Western blot analysis using as probe selected sera 
from HIV infected individuals. 

35 
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Example 13 - Purification of recombinant protein containing HIV envelo pe sequences from plasmid 
JlPBlsc 

1. Growth of cells: Cells were grown in a 10-liter volume in a Ctaemap fermentor in 2% 
medium Fermentation temperature was 30°C, the pH was 6.8, and air was provided 

5 at 1 win. Plasmid selection was provided by 20 /igAnl chloramphenicol. Typical cell 

yield (wet weight) was 30 g/L 

2. Cell lysis: 50 g, wet cell weight, of g. coli containing the recombinant HIV envelope 
fusion protein were resuspended in a final volume of 100 ml in 50 mM Tris-Cl pH 
8.0, 5 mM KEDTA, 5 mM DTT, 15 mM 0-mercaptoethanoI, 0.5% TRITON™X- 

10 100, and 5 mM PMSR 300 rag tysozyme was added and the suspension incubated for 

30 min at room temperature. 

This material was rysed using a BEAD-BEATER™ containing an equal 
volume of 0.1-0.15 mm glass beads. The lysis was done for 6 min at room 
temperature in 1 min intervals. The liquid was separated from the beads and 

15 centrifuged for 2J5 hr at 20,000 xg. The supernatant was removed and the pellet was 

resuspended in 100 ml 8 M urea, 20 mM Tris-Cl pH 8.0, 5 mM DTT, 15 mM p- 
mercaptoethanol, 5 mM PMSF, and 1 mM KEDTA. The pellet was solubilized using 
a polytron homogenizer and centrifuged at 20,000 xg for 2 hr. 

The supernate (90 ml) was dialysed against 4 liters of 8 M urea, 20 mM 

20 HEPES, pH 6.5, 1 mM EDTA, and 15 mM 0-mercaptoelhanol and 1 mM KEDTA 

at room temperature. The dialysate was loaded onto a 100 ml column (2-5 cm x 20 
cm) packed with CM FAST FLOW SEPHAROSE™ equilibrated in 8 M urea, 10 
mM HEPES pH 6.5, 15 mM 0-mercaptoethanol, and 1 mM KEDTA at room 
temperature. The column was washed with 200 ml equilibration buffer, and the 

25 protein eluted with a 1.0 liter linear gradient from 0-0.4 M NaCL The HIV protein 

(26 Kd) eluted at approximately 0.2 M NaCl as assayed by SDS-potyacrylamide gel 
electrophoresis. 

Further purification was obtained by pooling PBl sc -containing fractions and 
applying to an S-300 gel filtration column equilibrated in the same buffer as the 
30 previous column. 

Example 14 - Construction of and expression from plasmid pPBl wm-) 

Plasmid pPBlwMji which contains approximately 560 bp of DNA encoding essentially the 
HTV WMJ2 ejiv gene and from which is synthesized an approximately 26 Kd fusion protein containing 
35 this portion of the gpl20 envelope protein, can be constructed as follows: 
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1. Synthesizing DNA fragment in Table 7A. 

2. Restricting plasmid pREV2.2 with EcoRV and Bam HI and isolating the large 
fragment, approximately 4 Kd, from an agarose get 

3. Ligating the fragment prepared in 1. with the pREV22 fragment in a volume of 20 
5 jul using T4 DNA ligase, transforming the ligation mixture into competent cell strain 

CAG 629, and selecting ampidllin-resistant transformants. 

4. Selecting such transformants, by appropriate restriction patterns, that have the gpl20 
fragment cloned in the proper orientation to generate a fusion protein. When the 
strain harboring this recombinant plasmid is grown at 32°C In 2% medium containing 

10 50 ^g/ral ampicfllin and the total complement of cellular proteins electrophoresed on 

an SDS-poryacryiamide gel, a protein of approximately 26 Kd can be visualized by 
either coomassie blue staining or by Western blot analysis using as probe selected sera 
from HIV infected individuals. 

15 Example 15 - Purification of recombinant protein containing HTV envelope sequences from plasmid 

E£E1wmJ2 

1. Growth of cells: Cells were grown in a 10-liter volume in a Chemap fermentor in 2% 
medium. Fermentation temperature was 30°C, the pH was 6.8, and air was provided 
at 1 wm. Plasmid selection was provided by 20 yug/ml chloramphenicol. Typical cell 

20 yield (wet weight) was 30 g/L 

2. Cell lysis: 50 g, wet cell weight, of E. colt containing the recombinant HIV envelope 
fusion protein were resuspended in a final volume of 100 ml in 50 mM Tris-Cl pH 
8.0, 5 mM KEDTA, 5 mM DTT, 15 mM 0-mercaptoelhanol, 03% TRITON™ X- 
100, and 5 mM PMSF. 300 mg tysozyme was added and the suspension incubated for 

25 30 min at room temperature. 

This material was iysed using a BEAD-BEATER™ containing an equal 
volume of 0.1-0.15 mm glass beads. The lysis was done for 6 min at room 
temperature in 1 min intervals. The liquid was separated from the beads and 
cemrifuged for US hr at 20,000 xg. The supernatant was removed and the pellet was 
resuspended in 100 ml 8 M urea, 20 mM Tris-Cl pH 8.0, 5 mM DTT, 15 mM /5- 
mercaptoethanol, 5 mM PMSF, and 1 mM KEDTA The pellet was solubilized using 
a por/tron homogenizer and centrifuged at 20,000 xg for 2 hr. 

The supernate (90 ml) was diarysed against 4 liters of 8 M urea, 20 mM 
HEPES, pH 63, 1 mM EDTA, and 15 mM /5-mercapioeLtaanol. Dialysis was done 
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each time for 8 hr or longer with three changes of buffer. Spectraphor dialysis tubing 
with a 3.5 Kd MW cut-off was used. 
3. CM Chromatography; The dialysate was loaded onto a 100 ml column (2.5 cm x 20 
cm) packed with CM FAST FLOW SEPHAROSE™ equilibrated in 8 M urea, 10 
5 mM HEPES pH 6^5, 15 raM 0-mercaptoethanol, and 1 mM Na EDTA at room 

temperature. The column was washed with 200 mi equilibration buffer, and the 
protein eluted with a 1.0 liter linear gradient from 0-0.4 M Nad The HIV protein 
(26 Kd) eluted at approximately 0.2 M NaCl as assayed by SDS-potyacrylamide gel 
electrophoresis. 

10 Further purification was obtained by pooling PBlwMjs-containing fractions 

and applying to an S-300 gel filtration column equilibrated in the same buffer as the 
previous column. 

Example 16 - Cell fusion inhibition 
15 The in vitro fusion of HTV infected cells with T4 positive T cells is measured in the presence 

and absence of immune serum. This is a well known assay (Putney et al. [1986] Science 234:1392- 
1395). 

Chronically infected cells and uninfected cells (1:15) are mixed and incubated 24 hr. Foci of 
fused cells (giant cells) are then counted (usually about 60). Dilution of an immune serum, for 
20 example, serum to the entire HTV envelope (gpl60) or to a protein or peptide of the invention, is 

added when the cells are mixed. A 90% decrease in giant cells after 24 hr indicates the immune 
serum can block fusion. This assay can be done with cells infected with various virus strains, for 
example, HTV B and HTV RF . 



25 Example 17 - Competition cell fusion 

Using the assay described in Example 16, one can determine if proteins or peptides contain 
the epitope recognized by antibodies that are responsible for cell fusion inhibition. For example, 
fusion inhibition of HrV I]IB infected cells by antiserum to the PB1-III B protein of the parent 
application is abated by addition of PB1-III B protein to 5 fig/ml Using antiserum to PBM1I B and 

30 adding any one of the proteins or peptides, for example, Sub 2, Sub 1, CNBrl, peptide 135 or peptide 

136 at 5 /tg/ral totally blocks the activity of the PB1 antiserum. Additionally, antiserum to PB1-RF 
that is capable of neutralizing HIV-RF can be blocked in this activity by peptide 139. A peptide 
containing only the central portion of the peptide 139, eg., peptide 339, also can block the fusion 
inhibition activity of antiserum to PB1-RF. This, for the first time, localizes the critical amino acids 
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necessary to elicit neutralization or block fusion inhibiting antibody to a ten amino acid sequence (eg., 
peptide 339). 

Example 18 - Co-immunization of PBl-HIn and PBlo F 
5 Antisera from an animal immunized with two PB1 proteins from HTV IUB and HlVjyr isolates 

were capable of blocking cell fusion of both HTVrmi- and HIVR^infected cells. This demonstrates 
that co-immunization with separate proteins containing envelope sequences of two HTV isolates elicits 
an immune response capable of neutralizing both isolates. This novel property of small proteins or 
peptides blocking immune serum has not been described before. 

10 Some of the proteins and peptides of the subject invention contain the entire epitope for 

raising humoral immune responses that neutralize HTV infection and block HIV infected cell fusion. 
This is shown by these proteins and peptides competing these activities out of serum from animals 
immunized with the entire HTV envelope. More specifically, proteins and peptides that can compete 
the activities from anti-gp!60 or anti-PBl sera are Sub 2, Sub 1, CNBrl, peptide 135, and peptide 136. 

15 The proteins and peptides of the invention also can be used to stimulate a lymphocyte 

proliferative response in HIV infected humans. This then would stimulate the immune system to 
respond to HIV in such individuals. 

Example 19 - Construction of and expression from plasmid pPB1 l lre 
20 Plasmid pPBl, which contains approximately 540 bp of DNA encoding essentially the HIV cny 

gene from the PyuII site to the Bgi n site, and from which is synthesized an approximately 26 Kd 
fusion protein containing this portion of the gpl20 envelope protein, can be constructed as follows: 

1. Synthesizing the DNA with the sequence shown in Table 8: This DNA fragment can 
be synthesized by standard methods and encodes a portion of gpl20. It has a blunt 

25 end on the 5' end and an end which will ligate with a Bam HI overhang on the 3' end. 

2. Restricting 5 fig plasmid pREV2.2 with EcoRV and BamHI and isolating the large 
fragment, approximately 4 Kd, from an agarose geL 

3. Ligating 0.1 fig of the fragment in Table 8 with 0.1 fig of the pREVZ2 fragment in 
a volume of 20 ftl using T4 DNA ligase, transforming the ligation mixture into 

30 competent cell strain SO20251, and selecting ampicillin-resistani transformants. 

4. Using the AhaTIl restriction pattern of purified plasmid, selecting cells harboring the 
recombinant plasmid with the synthesized fragment in the orientation whereby the 
fragment blunt end ligated to the REV2.2 EcoRV end and the BamHI overhanging 
ends ligated together. Ahalll digestion of the proper plasmid gives fragment lengths 

35 of approximately 1210, 1020, 750, 690, 500, 340, and 20 base pairs. When the strain 
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harboring this recombinant plasmid is grown in 2% medium containing 50 /*g/ml 
ampicillin and the total complement of cellular proteins electrophoresed on an SDS- 
polyacrylamide gel, a protein of approximately 26 Kd can be visualized by either 
coo mass ie blue staining or by Western blot analysis using as probe selected sera from 
5 AIDS, ARC, or HIV infected individuals. 



Example 20 — Purification of recombinant protein containing HTV envelope sequences from plasmid 

PPBlnro 

1. Growth of cells: Cells were grown in a 10-liter volume in a Chemap fermentor in 2% 

10 medium. Fermentation temperature was 37°C, the pH was 6.8, and air was provided 

at 1 win. Plasmid selection was provided by 50 |<g/ml ampicillin and 20 /*g/ml 
chloramphenicol. Typical cell yield (wet weight) was 30 g/I. 
Z Cell lysis: 50 g, wet cell weight, of E. poli containing the recombinant HIV envelope 
fusion protein were resuspended in a final volume of 100 ml in 50 mM Tris-Cl pH 

15 8.0, 5 mM KEDTA, 5 mM DTT, 15 mM ^-mercaptoethanol, 0.5% TRITON™X- 

100, and 5 mM PMSF. 300 mg lysozyme was added and the suspension incubated for 
30 min at room temperature. 

This material was lysed using a* BEAD-BEATER™ (Biospec Products, 
Bartlesville, OK) containing an equal volume of a 1-0.15 mm glass beads. The lysis 

20 was done for 6 min at room temperature in 1 min intervals. The liquid was separated 

from the beads and centrifuged for 2.5 hr at 20,000 xg. The supernatant was removed 
and the pellet was resuspended in 100 ml 6 M guanidine-hydrochloride, 20 mM Tris- 
Cl pH 80, 5 mM DTT, 15 mM ^-mercaptocthanol, 5 mM PMSF, and 1 mM KEDTA 
The pellet was solubilized using a polytron bomogenizer and centrifuged at 20,000 

25 xg for 2 hr. 

The supemate (90 ml) was dialysed against 4 liters of 8 M urea, 20 mM Tris- 
Cl, pH 8.0, 1 mM EDTA, and 15 mM 0-mercaptoethanol. Dialysis was done each 
time for 8 hr or longer with three changes of buffer. Spectraphor dialysis tubing (S/P, 
McGraw Park, IL) with a 3.5 Kd MW cut-off was used. 

30 3. CM Chromatography: The dialysate was loaded onto a 550 ml column (5 an x 28 

cm) packed with CM FAST FLOW SEPHAROSE™ equilibrated in 8 M urea, 10 
mM potassium phosphate pH 7.0, 15 mM 0-mercaptoethanol, and 1 mM KEDTA 
at room temperature. The column was washed with 2 liters equilibration buffer, and 
the protein eluted with a 5.0 liter linear gradient from 0-0.4 M NaCt. The HIV 
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protein (26 Kd) eluted at approximately 02 M NaCl as assayed by SDS-polyacryiamide 
gel electrophoresis. 

Example 21 - Immunization with two or more peptides to obtain broadly neutralizing antisera 
5 Five peptides, it, peptide 135, peptide 139, peptide 141, peptide 142 and peptide 143, were 

cross-linked individually to carrier proteins and used to immunize goats. Each peptide is capable of 
eliciting type specific neutralization when used individually as an immunogen. Synthetic peptides were 
cross-linked through a sulfhydryl bond to keyhole limpet nemocvanin (KLH) by using Succinimidyl- 
4-(n-Maleimidomethyl)Cyclohexane 1-Carboxylate (Pierce). The ratio of peptide to KLH was 12 by 

10 weight. 200 fig of each cross-linked peptide was used in the immunization cocktail (a total of 1 mg 

of 5 peptides, 2 mg of KLH). This method of crosslinldng or immunization regimen is but an example 
and not meant to be limiting. After four immunizations, immune sera was tested for neutralization 
of these five HIV isolates as well as distinctly different isolates. The immune serum could block 
fusion of cells infected with any of five isolates from which the peptide sequences were derived. In 

15 addition, the serum neutralized other variants not used in the immunization. 

Equivalent broad neutralizing sera may also be obtained by variations of this immunogen. For 
example, using more than five peptides having the amino add sequence derived from the principal 
neutralizing domain from more than five variants. Alternatively, a single peptide (e.g., peptide 64 or 
peptide 74) containing segments homologous to diverse HIV variants may also be used to elicit broad 

20 neutralizing antibody. 

Example 22 - Sequential Immuniz ation with Two or More Peptides as a Method to Elicit Broad 
Neutralizing Antisera 

An immunization protocol capable of elidting broad neutralizing antibodies may take the form 
25 of initial immunization with a peptide or protein antigenkalry equivalent to the prindpal neutralizing 

domain, or segments thereof. The initial immunization is followed with a second immunization. The 
initial immunization could be done with, for example, peptide 135, peptide 139, peptide 141, peptide 
142, or peptide 143, with subsequent immunization with, for example, one or more of the following 
peptides: 
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The method is to immunize with a protein or peptide and then boost the immune response 
to a defined subset of the original immunogen. This immunization method may be useful in vaccine 
methodology and also to generate broad neutralizing polyclonal or monoclonal antibodies for 
5 therapeutic applications. 

Example 23 - Identification of Critical Segments of the Principal Neutralizine Domain 

Certain segments of the principal neutralizing domain have been found to be capable of 
eliciting the antigenic and immunogenic responses which are associated with the principal neutralizing 
domain as a whole. For example, a region of the principal neutralizing domain known as the "tip of 
the loop" has been shown to be capable of raising, and/or binding with, neutralizing antibodies. This 
capability is observed for the 'tip of the loop" of a variety of HIV variants. 

The tip of the loop comprises a three amino acid segment which is highly conserved between 
HIV variants, together with various amino acids which occur on either side of the three conserved 
amino acids. The three conserved amino acids, which are Gly Pro Gry, usually occur at, or about, 
positions 311, 312, and 313 of the HIV envelope protein. 

The "tip of the loop* comprises the Gly Pro Gly segment together with the 2 to 8 amino acids 
which flant either or both* sides" of this" segment in any given HTV variant. The amino acids which 
flank the conserved segment may be any of the 20 natural amino adds, in any sequential order. 
Although the amino acid sequence of the principal neutralizing domain varies between different HIV- 
1 isolates, conservation at particular positions, for example at the tip of the loop, suggests (hat certain 
amino acids at these positions are necessary for virus function. 

Example 24 - Sequence of the Prin cipal Neutralizing Determinant from Randomly Selected HlV-l 
25 Isolates 

Sequences of the principal neutralizing domains (PNDs) from random field isolates were 
obtained in order to determine the degree of heterogeneity within this region of the envelope protein. 
Peripheral blood lymphocytes (PBLs) from randomly selected HTV-1 infected donors were cither co- 
cultured with uninfected PBLs or the virus isolates were adapted to CD4 cell lines. DNA was 

30 extracted from these infected cells and a 240 base pair region encoding the PND was amplified by 

polymerase chain reaction using oligonucleotide primers that hybridize with flanking conserved regions. 
This product was cloned into pUC19 and the sequence of the PND from one or more clones from 
each original isolate were determined. Because of the heterogeneity of the virus population wilhin 
one infected individual, when two or more sequences were obtained from one PGR reaction, these 

35 sequences sometimes differed. 
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The data obtained from nearly 100 individuals (some infected with a heterogeneous virus 
population) was evaluated along with previously obtained HIV sequence information. Table 9 lists 
138 PND sequences from HTV isolates. These sequences indicate that, despite the well-known and 
frequently cited variability in the amino add sequence of the HIV envelope protein, there is actually 
5 a high degree of conservation in the immunologically critical PND region, particularly in the region 

at the center of the PND. Specifically, the Ory-Pro-Gry sequence at the tip of the loop' occurs in 
over 90% of the variants. Furthermore, other amino adds at certain positions on each side of the G- 
P-G were also found to be highly conserved. Table 10 shows the frequency of occurrence of the 
various amino adds at each position in the PND. In addition to the very strong conservation of the 
10 glycines flanking the central proline, there is strong conservation at several other positions (eg, R at 

x 12 , P at x n , G at y n , R at y 14 , and A at y 16 ). 

A comparison of the relative frequency of variations of a n-amino add segment centered 
about the G-P-G sequence is shown in Table 11. In this table, the sequence which reflects the most 
commonly occurring amino adds at each position is listed first The dashes indicate identity with the 
15 consensus sequence. The remaining sequences are ordered from 2 to 138, according to their homology 

to the consensus sequence. Thus, the sequences at the top of the table display the greatest homology 
with the consensus sequence. ^Sequences far down the table display less homology. For example, the 
amino' add sequences from isolates nm and LAV-BRU occur at positions 92 and 93, respectively, on 
this table. This indicates that these isolates have only limited homology with the consensus sequence. 
The present research shows that HTV viruses such as mB and LAV-BRU having the Gln-Arg (Q-R) 
dipeptide to the left of the Gly-Pro-Gly sequence are relatively uncommon. By contrast, the MN- 
like sequence in this region (JHIOP G.„) is the most common. The present research shows 
that principal neutralizing domains of other commonly studied variants comprise relatively uncommon 
sequences. 

Although the subject invention pertains to the discovery of certain highly conserved regions 
in the prindpal neutralizing domain, there remains some degree of variability in this region among 
the various isolates. This variability indudes -missing- or -added' amino adds at certain points in the 
sequence. Of course, -missing- or 'added' amino adds can cause difficulty in devising aesthetically 
pleasing tables showing the sequences. However, these missing or added amino adds pose no difficulty 
to a person skilled in the art in terms of locating the highly conserved regions which are critical to 
the subject invention. Table 9 shows one representation of 138 PND sequences. Table 11 uses a 
slightly different representation to show the same PND sequences. The primary discussion of sequence 
conservation can probably best be visualized by reference to the representation shown in Table 11. 
However, it should be noted that the existence of more than one means for representing these 
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sequences does not compromise the ability of the skilled artisan to accurately locale the sequences 
or the conserved regions. 

With the discovery of commonly occurring amino acid sequences, it is possible, for the first 
time, to develop prophylactic and therapeutic compositions which can predictably elicit and/or bind 
5 with neutralizing antibodies to a broad range of HTV variants. The generalized formula for such a 

composition can be as follows: 

aiGiGyb 

wherein x is 0 to 13 amino acids in length; 

y is 0 to 17 amino acids in length; and 
10 z is P f A, S, Q, or L; and 

either a or b, but not both, may be omitted; either a or b individually may comprise any one of the 
following: cysteine, a protein or other moiety capable of enhancing immunogenicity, a peptide from 
an HIV principal neutralizing domain, a peptide capable of stimulating T-cells, or a general immune 
stimulant 

15 Further analysis of common sequence patterns reveals certain specific patterns which are very 

common among HTV isolates. Examples of these common sequences are shown in Table 12 and 
Figure L These common patterns, which are known only as a result of the research and discoveries 
described here, can be used to make pharmaceutical and diagnostic composi lions which can be used 
with a broad range of HTV isolates. This, of course, can be of critical importance, given the large 
20 number of HTV variants which are now known to exist. 

Table 12 is a compilation of the common sequence patterns that occur in the region at ihe 
tip of the loop. For example, approximately 60% of the HIV isolates contain the core sequence I a 
I G P G R (a represents several different residues), approximately 50% contain the sequence 1 G P 
G R A, and approximately 40% contain G P G R A F. When a His residue is present at the a 
25 position inlalGPGR, this sequence occurs in approximately 30% of the HTV isolates. A vaccine 

composition comprising a mixture of peptides having the sequence I a I O P G R where all of the 
possible replacements for the a are present, is capable of eliciting antibodies which neutralizes a 
majority of HTV* variants. Preferably, for use as immunogens, the peptides are linked to carrier 
proteins or adjuvants as described in Example 21. 

As shown in Figure 1, common sequence patterns are also apparent within the 17 amino acid 
segment. Sequences which were isolated 4 or more times are highlighted. These commonly occurring 
sequences can be used to formulate vaccine cocktails which elicit a broadry neutralizing response. For 
example, a potential cocktail might contain peptides from each of the eight groups represented. 
Alternatively, the peptide sequences may be presented as a hybrid polypeptide containing the principal 
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neutralizing domain from two or mare of these groups. Preferably, such a cocktail will contain 
peptides which will be capable of raising antibodies which neutralize at least 70% and most preferably 
at least 90% of HIV variants. 

The antigens of the subject invention can be Identified by their ability to raise antibodies 
5 which bind to certain amino acid sequences. For example, particularly advantageous antigenic 

compounds would raise antibodies which bind to common amino acid sequences such as G-P-G-R- 
A-F, I-G-P-G-R-A-F, I-G-P-G-R-A, I-a-I-G-P-G-R, I-a-I-G-P-Q-R-A, and I-«-l-G-P-G-R-A-F, where 
a is any of the 20 amino adds. 

From Table 10 it can be seen that a polypeptide representing the occurrence of amino acids 
10 in all of the variants can be represented as follows: 

*13 x 12 X n X 10 it, Xg X 7 X6 X 5 X4 3t 3 X2 »1 O * G y 2 V3 y 4 y 5 y 6 y 7 y 8 y, y™ y n y 12 y 13 y u y 13 y 16 y, 7( 
wherein 

15 X] is I, R, M, 1QR, V,L,K,F, S, G, Y, SRG, or YQR; 

x 2 is H, R, Y, T, S, P, F, N, A, K, G, or V; 

x 3 is I, L, M, T, V, E, G, F, or Y; 

X4 is R, S, G, H, A, K, or not present; 

*5 is K, R, 1, N, Q, A, IR, RQ, or not present; 
20 is R, K, S, I, P. Q. E, G, or T; 

x 7 is T, K, V, I, A, R, P, or E; 

xg is N, NV, Y, KJ, I, T, DK, H» or K; 

X9 is N, S, K, E, Y, D, I» or Q; 

x 10 is N, Y, S, D, G, or H; 
25 x„ is P; 

x 12 Is R, 1, or K; 

x 13 is T ( I, M or A; 

1 is P, A, Q t S, or L; 

y, is R, K, Q, G, S, or T; 
30 y 2 is A, V, N, R, K» T, S, F, P, or W; 

y 3 is F, I, V, L, W, Y, G, S, or T; 

y 4 is Y, V, H, L, F, S, 1, T. M, R, VH, or FT; 

y 5 is T, A, V, Q, H, I, S, Y, or not present; 

y 6 is T, R, I, Q, A M, or not present; 
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y-j is G, E, K, R, T, D, Q, A, H, N, P, or not present; 
y 8 is R, Q, E, K, D, N, A, O, S, I, or not present; 
y 9 is I, V, R, N, G, or not present; 
y 10 is I, T, V, K, M, R, L, S, E, Q, A, or not present; 
5 y n is G, R, E, K, H, or not present; 

y 12 is D, N, I, R, T, S, or not present; 
y 13 is I, M, ME, L, or not present; 
y 14 is R, G, K, S, E» or not present; 
yi5 is Q, K, or R; 
10 y 16 is A; and 

y 17 is H, Y, R, or Q. 

Monoclonal and/or polyclonal antibodies with broad neutralizing activity can be generated 
using the commonly occurring peptide sequences for use in prophylactic or therapeutic compositions. 
15 The commonly occurring sequences described here can be used in much the same way as the other 

peptides described in this application. For example, these peptides can be modified in order to 
provide T-h/mphocyte stimulation, general immune stimulation, to enhance immunogen icily or 
solubility, or to reduce toxicity. The peptides may also be modified by addition of terminal cysteine 
residue(s) or by conjugation to a carrier protein, adjuvant, spacer, and/or linker. The peptides may 
20 be fused with other HIV epitopes to produce a multiepitope polypeptide which could be useful with 

an even greater number of HIV variants. Also, the peptides can be circularized by bonding between 
cysteine residues. The cysteine residues used to make such circularized peptides could be the naturally 
occurring cysteine residues at the ends of the principal neutralizing domain, or cysteine residues may 
be added to the terminal ends of the peptides. 

Additionally, vaccine compositions may include peptides containing T helper cell epitopes in 
combination with protein fragments containing the principal neutralizing domain. Several of these 
epitopes have been mapped within the HIV envelope, and these regions have been shown to stimulate 
proliferation and rymphokine release from lymphocytes. Providing both of these epitopes in a vaccine 
composition may result in the stimulation of both humoral and cellular immune responses. 
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Example 25 - Constru ction and Clonine of Multi-Epitope Gcp es 

Synthetic genes can be constructed which encode proteins comprised of the neutralizing 
epitopes from more than one HIV isolate. The synthetic gene exemplified here comprises a tandem 
arrangement of DNA sequences encoding neutralizing epitopes from HIV isolates IIIB, RF, SC, MN, 
and WMJ1. Each epi tope-encoding domain within the gene was designed to encode the 11 amino 
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acids centered at the common Gly-Pro-Gry sequence at the tip-of-the-loop tor each of the isolates. 
Thus, the multi-epitope gene contained 5 different coding regions, each of which encoded a 
neutralizing epitope from a different isolate. For this particular construction, the epitope which was 
chosen for each of the 5 isolates consisted of the Gly-Pro-Gry sequence along with the 4 amino acids 
5 on either side of the Gly-Pro-Gly sequence from each of the 5 isolates. Domains coding for other 

neutralizing epitopes from these isolates could have been incorporated into the multi-epitope gene. 
Also, genes coding for neutralizing epitopes from other isolates can be used. 

The genes were constructed such that the domains were linked by DNA sequences encoding 
four glycine residues. The composition or length of the linking sequence can be varied but preferably 
10 it is a sequence that is non-immunogenic itself. The DNA sequence of the synthetic gene described 

here was designed such that restriction sites were encoded at either end of the fragment to facilitate 
cloning into the vector or, alternatively, to permit the construction of longer multi-epitope genes by 
attachment of 2 or more shorter genes (Figure 2). In addition, a methionine residue was encoded at 
the 5* end of the gene to facilitate cleavage when produced as part of a fusion protein. 

Figure 3 depicts the steps in the construction of the multi-epitope gene described here. The 
amino acid sequence encoded by this gene is shown in Table 13. The portions of this amino add 
sequence which correspond to each of the 5 isolates are identified in Table 13. 

Double-stranded subfragments of the full-length gene were first constructed starting with 
single-stranded synthetic oligomers designed to encode tandem neutralizing epitopes and linking amino 
acid sequences. Any number of subfragments can be used. In this experiment the gene was divided 
into two portions, but three, four, or more portions can be used. Four single-stranded oligomers of 
between 67 and 78 nucleotides in length were synthesized (HEO-1, HEO-2, HEO-3, and HEO-4) 
(Figure 3). The oligomers were designed in pairs (HEO-1 and 2; HEO-3 and 4) as opposite and 
adjacent strands of the double-stranded subfragments having 10 (HEO-1 + 2) or 11 (HEO-3 +4) bases 
of complementary overlap. The oligomen of each pair were mired and heated 65*C for 5 minutes, 
then incubated at 37*C for 1 hour to anneal. 

After annealing, the complementary strands of each pair were completed ffiUed-in") using 
Sequenase (U.S. Biochem) and the four deoxynudeotidc triphosphates. This reaction was incubated 
for 1 hour at room temperature, heated at 65 *C for 3 minutes, and then incubated for an additional 
hour at 37'C with fresh Sequenase. Double-stranded fragments of 141 (HEO-1 +2) and 126 (HEO- 
3+4) base pairs were generated representing adjacent subfragments of the mulii-epitope gene. HEO- 
1+2 comprised the coding sequences for 3 epitopes plus adjacent linker amino acids; HEO-3+4 
extended from the fourth epitope to the end of the gene. 

Following the fill-in reaction, the samples were extracted with phenol/chloroform and 
precipitated with ethanol by standard procedures. The resulting double-stranded DNAs were digested 
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with Hindm (HEO-1+2) or SacI (HEO-3+4) and purified on a 3% NuSicvc agarose gel. The 
purified fragments were ligated with Hindm + SacI digested pUC19 (New England Biolabs) in a 3- 
component ligation and transformed into col| JM10S cells. The presence of a 256 base pair 
fragment in pUC19, encoding the full-length multi-epitope gene, was confiinned by restriction analysis 
5 and DNA sequencing. The resulting plasmid was designated pUC/MEP-1. 

The MEP-1 insert was removed from pUC/MEP-1 and recloned into Hindlll + Sac I digested 
pRevZl for high-level expression of a fusion protein comprised of a leader portion from the £. coji 
BG gene fused to the multi-epitope protein. The resulting plasmid, designated pMEP-1-8342, was 
transformed into £ coji strain SG20251 and the 12.9 Kd multi-epitope fusion protein was identified 

10 by coomassie blue staining or Western blot analysis using a probe selected from antisera to the loop- 

tip peptides from each of the 5 HIV isolates. The fusion protein can be used intact or, alternatively, 
the leader portion can be cleaved off by cyanogen bromide which cleaves on the carboxy-terminal side 
of methionine residues. The amino acid sequence of the fusion protein is shown in Table 13A. 

The multicpitope peptide can be purified from recombinant cells by methods described above. 

IS Other synthetic genes can be constructed which encode tandem neutralizing epitopes from any 

number of different HIV isolates using the procedure described above. In addition, variations on the 
above procedure can be made which are meant to be included in the present invention. For example, 
the lengths of the neutralizing epitopes encoded by a gene can vary, and there can be variation in the 
length of the individual epitopes within a single gene. Further, the number of neutralizing epitopes 

20 within a multi-epitope gene can vary, and the composition or the length of the amino acid sequences 

of the epitopes or the linking sequences can be varied from the example that is described herein. 

It should be understood that the examples and embodiments described herein are for 
illustrative purposes only and that various modifications or changes in light thereof will be suggested 
25 to persons skilled in the an and are to be included within the spirit and purview of this application 

and the scope of the appended claims. 

This work was supported in part under contract number N01-A1-62558, awarded to Repligen 
Corporation by the National Institute of Allergy and Infectious Diseases (NIAID). 
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TABLE 2 

LBuA*r.GlnSepV.lGluIl«ftsr 1 C«»ThrftraPrDA.nA.nft«r.ThrAr«L«. 
S.pIl.Ar«Il.BlnAP«01wPro01«Ar-Al.Ph.V.lThrIl-01iiL«.IX. 
Bl»A.nM.tAr-61nAl.Hi.C«.A.r.H.B.PAr«Al.Lw.Trr».nA.nThr 

TABLE 2A 

CTGAACCAATCTGTAGAAATTAATTBTACAABACCCAACAACAATACAAOAAAA 
AGTATCCGTATCCAGAGAGGACCAGGGAGAGCATTTGTTACAATAGGAAAAATA 
OMAATATGABACAAGCACATTSTAACATTABTMABCAAAATGBAATAACACTTT 

TABLE 2B 

N .tL.uAT«ProV.lBl U 7hrPToThrAp-BluIleL«.LwmL.uA.FGl»U.o 
Tr^Al.Ph.B.rL.uA,PAr.BluAr*W.lAl.A.pL. U ».nBlnS.rW.161u 
Il.AsnC««-rhpAr-ProA.nA.nA.nThrAr«L».BtPll.Ar«Il.BlnAp« 

Gl,,ProBl-Ar*Al.Ph.«tlThni.Bl«Lw.n.Bl«A.nM«tAr«Glr,Al. 
Hi.Cv.A.r.:i.B.pAr-Al.LwTrPA,nA.nThPL.uBl-Al.A«Il.L e u 

GluAsvGIuArSAlsSer 

TABLE 2C 

^TGTTA^GTnCTGTAGAAACCCCAA-CGTGAAATCAAAAAACTGGA-GG^TG 
TGGGCATTCAG~ CT G G ATCG CG A ACGCGTGGCCG ATCTG A ACC AATC T G7 AG A A 
ATTAATTGTAC a AG ACC C A AC A AC A ATAC A AG A A A AAGT ATCCGT AT CC AG AG A 
GG AC C AGGG AGhGCATTTGTT AC AAT AGG AAAA AT AGGAA AT ATG AG AC AAGC A 
CATTGT A AC ATT AGT AG AGC A A AATGGAAT AACACTTTGGGADCTCG AATTCTT 

GAAuACGAmAGGGCCTCG 
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TABLE 3 

LeuAsriGlnSerValBluIleAsnCMsThrArflProAsnAsnAsnThrAraLys 
SerlleAralleGlriAraGlsProDlyArsAlBpheUslThrlleGluLysIle 
GlyAsnMetAr^GInAlaHisCasAsnlleSerArsAlBLysTrpAsnAsnThr 
LeuLysGlnlleAspSerLysLeuAr^GluGlnPheGlyAsnAsnLysThrlle 
IlePheLysGlnSerBerGlyGlyAspProGluIleValThrHisSerF'heAsn 
CysGlyGlyGluPhePheTyrCwBAsnSerThrGlnLeuPheAsnSer 

TABLE 3A 

CTGAACCAATCTGTAGAAATTAATTGTACAAGACCCAAC AACAAT ACAAGAAA A 
AGTATCCGTATCCABAGAGGACCA6BGAGAGCATTTGTTACAATAGGAAAAATA 
GGAAATATGAGACAAGCACATTGTAACATTAGTAG AGCAAAATGGAATAACACT 
TTAAAACAGATABATABCAAATTAABABAACAATTTBBAAATAATAAAACAATA 
ATCTTTAAGCAGTCCTCAGGAGGGGACCCAGAAftTTGTAACGCACAGTTTTAAT 
TGTGGAGGGGAATTTTTCTACTGTAATTCAACACAACTGTTTAATAGT 
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TABLE 3B 

KetLeuArsProVslGluThrProThrAraGluIleLasLysLeuAspGlyLeu 
TrpAlaPheSerLeuAspArdGluAraVelAlsAspLeuAsnGlnSerVslGlu 
HeAsnCwsThrArdProAsnA*nAsnThrAr3L«sSerIleArSlleGlnArfl 
GlyProGlyArdAlsPheValThrlleGlyLusIleGlyAsnhBtArsGlnAls 
HisCysAsnlleSerAraAlaLysTrpAsnAsr.ThrLeuLysGlnlleAspBer 
LysLeuArdGluGlnPheGlwAsnAsnLvsThrllellePheLtfsGlnSerSer 
GlyGlwA5PProGluIleyalThrHisSerPhe.AsnCBsGl»GlaGluPhePhe 
TyrCysAsnSerThrGlnLeuPheAsnSerGlySerSerAsnSer 

TABLE 3C 

ATGTTACGTCCTGTAGAAACCCCAACCCGTGAAATCAAAAAACTGGACGGCCTG 
T6GGCATTCAGTCTGGATCGCGAACGCGTGGCCGATCTGAACCAATCTGTAGAA 
AXT AATT6T AC A AGAC-C AACAACAATACAAGAA AAAGT ATCCGT ATCC AG AG A 
GG ACC AGGGAGAGCAT7TGTTACAAT AGGA AAA AT AGGAA AT ATG AGACAAGC A 
CATTGTAACf-TTAGTADAGCAAAATGGAATAACACTTTAAAACAGATAGATAGC 
AA ATT AAGAGAACAATTT GGAAAT AATAAAAC A AT AATCTTTAAGCAGTCCT C A 
GGAGGGGAwCCAGAAATTGTAACGCADAGTTTTAATTGTGGAGGGGAATTTTTC 
TACTGTAATTCAACACAACTGTTTAATAGTGGGAGCTCGAATTCT 
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TABLE 4 



LsuAsr.AlaSerValGlraieAsnCysThrAr-ProAsnAsnAsnThrAraLy* 
SerlleThrLasGlyProGlyArsVaineTyrAlaThrGlyGlnlltlleGly 
AspIleAr-LysAlsHisCysAsnLeuSerArSAl aGlnT r*A»nAsnThrLeu 
LasGlriValValThrLasLeuArSGluGlriF'heAspA5nL«sThrIl«UslPhe 
ThrSerSerSe rG ly G 1 *AspF' roGlu 1 1 e Va lLeuHi s5e rPhe AsnCy *G 1 w 
GlwGluPhePheTyrCasARnThrThrGlrtLeuPheAsnSerThrTrf-AsnSer 
ThrGluGlySerAsnAsnThrGlyGl* AsnAs*ThrIleThrLeuProCwsAra 
UeLwsGlnllBUalAsnMetTrpGlriGluUalGlaLwsAlaMetTwrAlaF'ro 
PrcIleSerGlwGlrilleLwsCwsIlaSepAsr.1 1 eThrGl wLeuLeuLeuThr 
ArflAspGl yGl yGluAspTh rThrAsnThrTh r 



TABLE 4A 



CTGAATGCATCTGTACAA ATT AATTGTACAAGACCCAAC A AC AATACAAGAA AA 
G AC7TACGTAGAC ATGTTT AATT AAC ATGTTCT6GGTTGTTGTTATGTTCTTTT 

AGT AT AACTAAGGG ACC AGGGAGAGT AATTT ATGC AfiCAGGACA AAT AAT AGG A 
TCATATTGATTCCCTGGTCCCTCTCATTAAATACGTTGTCCTGTTTATTATCCT 

GATATAAGAAAAGCACATTGTAACCTTAGTaGaGCACAATGGAATAACACTTTA 

ctatattcttttcgtgtaacattggaatcatctcgtgttaccttattgtgaaat 

aa ac aggtagtt aca aa att aagagaac aat7tg ac a at aaaacaat agtcttt 
tttgtccatcaatgtttt^attctcttgttaaactgttattttgttatcagaaa 

acgtcatcctcaggaggggacccagaaattgtacttcacagttttaattgtgga 
trcagtaggagtcctcccctgggtctttaacatgaagtgtcaaaattaacacct 

ggggaatttttctactgtaatacaacacaactgtttaat agtacttggaatagt 
ccccttaaaaagatgacattatgttgtgttgacaaattatcatgaaccttatca 

ftctdaagggtcaaataacactdgttggaaatcac ac aatc acactccc atgcad a 
TGACTTCCCAGTTTATTGTGACCTCCTTT ACTGTGTT agtgtgagggtacgtct 

ftTAAAACAAATTGTAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCCCCT 
TATTTTGTTTAACATTTGTACACCGTCCTTCATCCTTTTCGTTACATACGGGGA 

CCCATCAGTGGACAAATT A A ATGT AT ATCA AAT ATT ACAGGGCT ACTAT7 AACA 
GGGT AGTCACCTGTT7 A ATTT AC AT AT AGT7T AT A ATGTCCCGATGATAATTGT 

AG AGATGGGGGTGAAG AT AC AACT A AT ACT AC AG A 
TCTCT ACCCCCACTT CT ATGTTG AT 7 ATGATGTCTC7 AG 
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TABLE 4B 

MetLeuArsProValGluThrProThrArsGluIleLusLysLeuAspGluLeu 
TrpAlsPheSerLeuAspAr^GluArSVal AlaAspLeuAsnAleSerValGIn 
IleAsnCysThrAraProAsnAsnAsnThrArSLysSerlleThrLysGlyPro 
GlyArSVallleTyrAlsThrGlyGlnllelleGlyAspIleArsLysAlBHis 
CysAsriLeuSerAraAlBGlnTrpAsnAsnThrLeuLysGlnVslVsiThrLys 
LeuArSGluGlnPheAspAsnLysThrlleVelPheThrSerSerSerGlyGlu 
AspProGluIleValLeuHisSerPheAsnCysGl yG 1 y G 1 uPhePheTy rCys 
AsnThrThrGlnLeuPheAsnSerThrTrpAsnSerThrGluGlySe rAsriAsn 
ThrGlyGlyAsnAspThrlleThrLeuProCysAr^I leLysGlr.I leUsl Asr. 
MetTrpGlnGluValGlyLysAlaMetTyrAlaProProIleSerGlyGlnlle 
LysCysIleSerAsnll eTh rGlyLeuLeuLeuTh r A r£ As pGl y G 1 y G i uAsp 
ThrThrAsnThrThrGluIleAraArSGlnAleSer ArSGl uLeuGluPheLeu 
Ly5ThrLy5GlyProArSAspThrProIlePheIleGly 
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TABLE 4C 

ATGTTACGTCCTGTAGAAACCCCAACCCGTGAAATCAAAAAACTG6ACGGCCT6 
TGGGCATTCAGTCTGGATCGCGAACGCGTGGCCGATCTBAATGCATCTGTACAA 
ATTAATTBTACAABACCCAACAACAATACAAGAAAAAGTATAACTAA66GACCA 
6GGABAGTAATTTATBCAACABGACAAATAATAGGAG AT ATAAGAAAAGCACAT 
TGT A ACCTT AGTAGAGC ACAAT6G AATAACACTTT AAAAC AGGT AGTT AC AAAA 
TT A AG AB AACAATTTGAC AATAAAACAATAGTCTTTACBTC ATCCTC AGGAGGG 
GACCCAGAAATTGTACTTCACAGTTTTAATTGTGGAGGGGAATTTTTCT ACTGT 
AATACAACACAACTGTTTAATAGTACTTGGAATAGTACTGAAGGGTCAAATAAC 
ACTGGABGAAATGACACAATCACACTCCCAT6CAGAATAAAACAAATTGTAAAC 
ATGTBGCABGAABTAGBAAAAGCAATGTATGCCCCTCCCATCAGTGGACAAATT 
AAAT6TATATCAAATATTACAGGGCTACTATTAACAABAGATBGGGGT6AAGAT 
ACAACT AATACTACAGAG ATCCGTCGACAAGCTTCCCGGGAGCTG GAATTCTTG 
AAGACGAAAGGGCCTCGTGATACTCCTATTTTTATAGGT 
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TABLE 5 

GluAsnPheThrAspAsnAlaLysThrllel 1 eU a lHi sLeuAsnGl u 
S» rUalGl nlleAsriCyftThrArSProAsriTyrAsnLysArsLvsArfll leHi s 
IleGlyProGlwArSAlaPheTvrThrThrLysAsnl lei 1*G1 wThrlleArs 
GlriAlaHisCysAsrilleSerArSAlaLwsTrfAsnAspThrLeuArSGlrillB 
VelSerLasLeuLysGluGlnPheLysAsnLwsThrlleUalPheAsriGlriSe r 
SerGlyGlaAspProGluIleUElMetHisSerPheAsnCfsGlyGluGluPhB 
PheTa rCasAsnThrSe rProLeuPheAsnSerThrCvsLu si leLy sGlrJ 1 e 
IleAsnMetTrpGlnGluValGlyLstAl aMetTwr A 1 aProProI 1 eGl uGl a 
GlnIleArsJCysSerSerAsrtIleThrGlyLeuL«uLeuThrArsAspGlyGly 
LysAspThrAspThrAsnAsF'Thr 

TABLE 5A 

GATCTGAAAATTTC ACAG AC AA7GCT AA AACC ATAATAGTACACCTG AATG AA 
ACTTTTAAAGTGTCTGT7ACGATTT75G7ATTATCATGTGGACTTACTT 

TCTGT AC AAATT AATTGT AC AAGACCCA ACT ACAAT AA A AGAA AAAGG ATAC AT 
riGACMTGTTTAATTAACATGTTCTGuGTTGATGTTATTTTCTTTTTCCTATGTA 

ATAGGACChGGGAGAGCATTTTATACaACAAAAAATATAATAGGAACTAT A AG A 

7atcctggtccctctcgtaaaa7atg7tg7t7tttatattatccttgatat7c7 

caagcacattgtaacattagtagagcaaaatggaatgacactttaagacagata 
gttcgtgt aacattgt aa? c atctcgttttacctt actgtgaaa7tctgtct at 

gttagcaaattaaaagaacaatttaagaataaaacaatagtctttaatcaatcc 
caatcgtttaatt77c77"taaa7tc7tat77tg7ta7cagaaattagttagg 

t c agg agggg ac cc ag aaa~tgt a atgc ac agt7tt a attgtgg a gggg a a77t 
agtcctcccctgggtc77" a ac att acdtgtcaaaatt aacacctcccc77 aaa 

ttct actgt a at a c at c a c c ac t g777 a at agt acttgc a a a at a a a ac a a att 

AAGmTGACATTATGTAuTGGTGACAmAT~ATCATGAACGTTTTATTTTGTTTAm 
ATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATTGAAGGA 

tatttgtacaccgtccttcatccttttcgttacatacggggagggtaacttcct 

CAAATT AGATGTTCATCAAATATTACAGGGCTACTATTAACAAGADATGGTGGT 
GTTTAATCTACAAGTAGTTTFiTAATGTCCCGATGATAATTGTTCTCTACCACCA 

AAGGACACGGACACGAACGACACCoA 
TTCCTGTGCCTGTGCTTGCTGTGGCTCTAG 
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TABLE 5E 

HetLeuArsProValGluThrProThrArSGluIleLysLysLeuAspGlwLeu 
TtpAI aPheSerLeuAspAraGluArSVal AIbAspI lei leAspGl ySerGl u 
AsnPheThrAspAsnAl BLysThrllelleValHisLeuAsnGluSerValGln 
IleAsnCysThrArgProAsnTyrAsnLysArsLysArglleHi sIleGlyPro 
GlyArsAlaPheTyrThrThrLasAsrillelleGlyThrl 1 eAraGl nAl sHi s 
Cu sAsnl leSerAraAl aLasT rpAsnAspTh ?LeuAr SG 1 n 1 1 eUs 1 Sb rLy s. 
LeuLysGluGlnF'heLysAsnLysThrl leVBlPheAsnGl nSe rSe rGl yGl y 
AspProGluI leValMetHisSerPheAsnCysDlyGl yGl uPhePheTa rCy e 
AsnThrSerProLeuPheAsnSe rThrCysLysI leLysElrJ lelleAsnMet 
TrpGlnGluValGlyLysAlaMetTyrAlaProProIleGluGlyGlnlleArsi 
CysSerBerAsnl leThrGlyLeuLeuLeuThrArsAsfGl yGl yLysAspThr 
AspThrAsnAspThrGluIleArflArdGlnAlzSerArdGluLeuGluPheLeu 
LysThrLysGlyProAraAspThrProIlePhel leGly 
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TABLE 5C 

ATGTTAC6TCCTGTAGAAACCCCAACCCGTGAAATCAAAAAACTGGACGGCCTG 
TGGDC ATTCAGTCTGGATCGCGAACGCGTGGCC6AT ATCATCGATGGATCTG A A 
AATTTCACAGACAATGCTAAAACC AT A AT AGTACACCTGAATGAATCTGT ACAA 
ATTAATTGTACAAGACCCAACTACAATAAAAGAAAAAGGATACATATAGGACCA 
GGG AGAGC ATTTTATACAACAAAAAAT AT A ATAGGAACTATAAGACAAGCACAT 
TGTAACATTAGTAGAGCAAAATGGAATGACACTTTAAGACAGATAGTTAGCAAA 
TTA AAAGAACAATTTAAGAATAAAACAATAGTCTTT AATCAATCCTCAGGAGGG 
GACCCAGAAATTGTAATGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTGT 
AATACATCACCACTGTTTAATAGTACTTGCAAAATAAAACAAATTATAAACATG 
TGGCAGG AAGTAGGAAAAGCAATGT ATGCCCCTCCC ATTGA AGG ACAAATTAGA 
TGTTC ATCAAAT ATTACAGGGCTACT ATT AAC AAG AG ATGGTGGTAA6GACACG 
GACACGAACGACACCGAGATCCGTCGACAAGCTTCCCGGGAGCTGGAATTCTTG 
AAG ACGA A AGGGCCTCGTG AT ACTCCT ATTTTT AT AGGT 
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TABLE 6 

LeuLysGl uAlaYalGluI 1 eAsnCusThrAraProAsriAsnAsruhrThrArS 
Serl 1 eHi si leGlyProGlwArsAlsPheTurAlaThrGlw AbpI lei 1 eGly 
Asp I leArdGlnAlaHisCysAsnl leSe rArSAl aLysT rpAsnAsnThrLeu 
LysGlrilleVBlIleLysUeuArsAspGlnPheGluAsnLysThrllel UF'he 
Asr.ArdSerSerGlyGlyAsPProGluIleValMetHisSerPheAsnCwsGly 
GlyGluFhePheTyrCysAsnSepThrGlnLeuPheSerSerThrTrpAsnGly 
ThrGauGlySerAsnAsnThrGlaGlyAsnAsf-Thrl leThrLeuP roCysArS 
I leLwsGl ul lelleAErirtetTrpGlnGluUalGltfLwfiAlaMetTyrAl aPro 
ProIleLysGlyGlr.ValLvsCysSerSerAsnlleThrGl yLeuLeuLeuTh r 
ArsAspGlyGlyAsnSerLysAsnGlySerLysAsnThr 

TABLE 6A 

CT6AAAG A AGCTGT AG AAATT AATTGT AC AAGGCCCA ACA AC AAT ACAACA AGA 
GACTTTCTTCGACATCTTTAATTAACATGTTCCGGGTTGTTGTTATGTTGTTCT 

AET AT AC ATAT AGG ACC AGGG AGAGC ATTTT ATGC A AC AGG AGACAT A ATAGG A 
TCATATGTATATCCTGGTCCCTCTCGTAAAATACGTTGTCCTCTGTATTATCCT 

GaTATAAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATAACACTTTA 
CT AT ATTCTGTTCGTGT AACATTGT AATCATCTCGTTTT ADCTT A7TGTGAAAT 

AAACAGATAGTTATAAAATTAAGAGACCAATTTGAGAATAAAACAATAATCTTT 
TTTGTCTATCAAT ATTTT A ATTCTCTD GTT AAACTCTT ATTTTGT TATTAGAAA 

AATCGATCCTCAGGAGG AG ACCCAGAA ATTGT AATGCACAGTTTT AATTGTGG A 
TTAGCT AG GAGT CCTCCTCTGGGTCTTTAACATTACGTGTCAAAATT AACACCT 

GGGGA ATTTTTCT ACTGT AATTCAACAC AACTGTTT AGT AG" ACTTGG AATGGT 
CCCCTT aa AAAG ATGACATT AAGTTGTGTTGACAAATCATCATGAACCTT ACCA 

ACTGAAGGGTCAAATAACACTGGAGGAAATGACACAATCAC^TCCCATGCASA 
TGACTTCCCAGTTT ATTGT GACCTCCTTT ACTGTGTT AGTGGGAGGG . ACGTCT 

ATAAAAGAAATTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCCCCT 
ThTTTTCTTTAATATTTGTACACCGTCCTTCATCCTTTTCGTTACATA3GGGuA 

cccatcaaaggacaagttaaatgttcatcaaatattacagggctgctattaaca 
gggtagtttcctgttcaatttacaagtagtttataatgtcccgacgataattgt 

agagatggtggtaat agcaagaatggtagcaagaatacaga 
tctctaccaccattatcgttcttaccatcgttcttatgtctctag 
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TABLE 6B 

MetLeuArsPrDValGluThrProThrArgGluIleLysLysLeuAspGl wLeu 
TrpAlaPheSerLeuAspArfiGl uArfiVa 1 AlsAspLeuLvsGl uAlaVsl Gl u 
I leAsnCysThr ArgProAsnAsnAsriThrThr Ar^Ser I leHi s IleGlaPro 
GlyAraAlaPheTyrAlaThrGlyAspIlelleGlyAspIleArsGlnAl aHis 
CysAsnlleSerAraAl aLysTrpAsnAsnThrLeuLysGlnl leVslIleLys 
LeuAraAspGlnPheGluAsnLysThrl 1 el lePheAsnArsSe rSerGl yG 1 y 
AspProGluIleValMetHisSerPheAsnCysGluGlyGluPhePheTyrCys 
AsnSerThrGlnLeuFheSe rSe rThrTrp AsnGlyThrGluGlySe r AsnAsr. 
ThrGlyGlyAsnAspThrlleThrLeuProCwsArSl leLysGluI lei leAsn 
MetTrpGlnGlaVslGlvLysAleMetTyrAlaProFroIleLysGlyGlnVal 
LysCysSerSerAsnl leThrDlyLeuLeuLeuThrArsAspGl yG lyA&nSer 
LasAsnGlySerLysAsnThrGl u I 1 eArSArsGlnAlaSerArsGluLeuGlu 
PheLeuLysThrLysGlyProArsAsF Th r Pro 1 1 ePhe I leGly 
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TABLE 6C 

ATGTTACGTCCTGTAGAAACCCCAACCCGTGAAATCAAAAAACTGGACGGCCTG 
TGGGCATTCAGTCTGGATCGCGAACGCGTGGCCGATCTGAAAGAAGCTGTAGAA 
ATTAATTGTACAAGGCCCAACAACAATACAACAAGAAGTATACATATAGGACCA 
GGGAGAGCATTTTATGCAACAGGAGACATAATAGGAGATATAAGACAAGCACAT 
TGTAACATTAGTAGAGCAAAATGGAATAACACTTTAAAACAGATAGTTATAAAA 
TTAAGAGACCAATTTGAGAATAAAACAATAATCTTTAATCGATCCTCAGGAGGA 
GACCCAGAAATTGTAAT6CACAGTTTTAATTGTGGA6GGGAATTT7TCTACTGT 
AATTCAACACAACTGTTTAGTAGTACTTGGAATGGTACTGAAGGGTCAAATAAC 
ACTGGAGGAAATGACACAATCACCCTCCCATGCAGAATAAAAGAAATTATAAAC 
ATGTGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATCAAAGGACAAGTT 
AAATGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAGC 
AAGAATGGTAGCAADAATACAGAGATCCGTCGACAAGCTTCCCGGGAGCTGGAA 
TTCTT6AAGAC6AAAGGGCCTCGTGATACTCCTATTTTTATAGGT 
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TABLE 7 

LeuAsnGluSe rVelGluI 1 eAsnCasTh r A rdP roT«r AsriAtnUal ArsA rS 
SerLeuSerll eGlapToGlaArsAlaPheArflThrArsGluU ell eGl wile 
I leArgeinAl BHisCasAsnlleSerArSAlaLvsTrpAsnAsnThrLeuLas 
GlnIleV3LGluLysLeuArdGluGlnPheLwsAsriL«sThrIleUslF-heA5n 
HisSerSerGlvsGlaAspPrDGluIleValThrHisBerPheAsnCasGl aGly 
GluPhePheTa rCasAsnSerThrGlnLeuPheAsnSe rThrTrpAsnGl aThr 
AspIleLasG laAspAsnLasAsnSerThrLeuIleThrLeuProCasArsI 1 e 
LtfsGlnllelleAsnHetTrfr GlnGlaValGl aLys Al aMetTyrAlaProPro 
IleGlnGlyGlnlleArSCuBSerSerAcrilleThrGlaLeuLeuLeuThrArs! 
AspGI aGlaAsnSerSerSerArsGlu 

TABLE 7A 

CTGAATGAATCTGTAG A A ATTAATTGTACAAGACCCTAC AAC AATGT A AG AAGA 
GACTTACTT AGACATCTTT AAT7 AACATGTTCT6GGATGTTGTT ACATTCTTCT 

AGTCT ATCTATA5GACCAGGGAGAGCATTTCGTACA AG AG A A ATAATAGG A ATT 
TCAGATAGATATCCTGGTCCCTCTCGTAAAGCATGTTCTCTTTATTATCCT7AA 

ATAAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATAACACT7TAAAA 
TATTCTG7TCGTGTAAC ATTGT AATCATCTCG7TTT A2C77 ATTG7GAAATT7T 

C AGAT AGTTGAGA A ATTA AG AGAACAAT7T Amu AAT AAA ACAATAGTCTTTAAT 
GTCTATCAACTCTTTAATTCTCTTGTTAAATTCTTATTTTGTTATCAGhAATTA 

CATTCCTC AGG AGGGG ACCC AGA AATTGTAACGCACAGTTTTAATTGTGGAGGG 

gtaaggagtcctcccctgggtctttaacattgcgtgtcaaaattaacacctccc 
ga atttttct actgt aattcaacacaactgttt a atagt acttgga atggt act 

CTT AAAAAGATGACATTAAGTTGTGTTGACAAATTATCATGAACCTTACCATGA 

GACATTAAAGGAGATAATAAAAATAGCACACTCATCACACTCCCATGCAGAATA 
CT GT AATTTCCTCT ATT ATTTTT ATCGTGTGAG7AGTGT GAGGGT ACGTCTT AT 

AAACAAATTATAAACATGTGGCAGGGAGTAGGCAAAGrrAATGTATGCCCCTCCC 
TT TGTTT AAT ATTTGTAC ACCGTCCCTCATCCGTTTCGTT AC AT ACGGGGAGGG 

AT CC A AGG AC A A ATT AG ATGTTC ATCAA AT ATTACAGGGCT GCT ATT A ACAAG A 
TAGGTTCCTGTTT A ATCT ACAAGT AGTTTAT A ATGTCCCGACG AT A ATTGTTCT 

GATGGTGGTAATAGCAGCAGCAGGGAAGA 
CTACCACCATTATCGTCGTCGTCCCTTCTCTAG 
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TABLE 7B 

MetLeuArslProValGluThrProThrArflGluIleLwsLwsLeuAspGlvLeu 

TrpAlaPheSerLeuAspArSGluAraVslAlaAspLeuAsnGluSerVelGlu 

IleAsnCa*ThrftraProTtfrAsnAsnV8lApaftp<SerLeuSerIl»GlyPro 

GlyArdAlaPheAraThrAraGluIlBlleGlwIlelleAraGlnAlBHisCys 

AsnlleSerAraAlaLysTrpAsnAsr.ThrLeuLyBGlnlleVBlGluLysLeu 

Ar^GluGlnPheLysAsnLysThrlleVslPheAsnHisSerSerGlyGlyAsp 

ProGluIleVBlThrHisSerPheAsnCysGl^GlyGluPhePheTyrCysAsn 

SerThrGlriLeuPheABnSerThrTrpAsnGlHThrAtPlleLtfsGlyAspAEn 

L»sA5nSerThpLBUlleThrLeuProC«sAr*IleLwsGlnIleIleAsr.Met 

TrpGlriGlyValGltfLysAlaMetTyrAlaProProIleGlnGlyGlnlleArs 

CysSerBerAsnlleThrGlyLeuLeuLeuThrAr^AspGlyGlyAsnEerBer 

SerArSGlu61uIleAraArSGlnAlaSerArdGluLeuGluPheLeuL-5Thr 

LysDlyProAraAspThrProIlePhelleDly 
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TABLE 7C 

ATGTTACGTCCTGTABAAACCCCAACCCGTGAAATCAAAAAACTGGACGGCCT6 

TGGGCATTCAGTCTGGATCDCGAACGCGTGGCCGATCTGAATGAATCTGTAGAA 

ATTAATT6TACA AGACCCTACAACAATGTAAGA AGAAGTCT ATCTAT AGG ACC A 

GGGAGAGCATTTCGTACAAGAGAAATAATAGGAATTATAAGACAAGCACATTGT 

AACATTAGTAGAGCAAAATGGAATAACACTTTAAAACAGATAGTT6AGAAATTA 

AGAGAACAATTTAAGAATAAAACAATAGTCTTTAATCA7TCCTCAGGAGGGGAC 

CCAGAAATTGTAAC6CACAGTTTTAATTGTGGAGGGGAATTTTTCTACTGTAAT 

TC AACAC A ACTGTTT AATAGTACTTGGA ATGGT ACTG ACATTA A AGG AGAT A AT 

AAAAATAGCACACTCATCACACTCCCATGCA6AAT A A AACAAATT AT AA AC ATG 

TGGCAGGGAGTAGGCAAAGCAATGTATGCCCCTCCCATCCAAGGACAAATTAGA 

TGTTCATCAAAT ATTACAGGGCTGCTATTAACA AGAGATGGTGGTAAT AGC AGC 

AGCAGG6AAGAGATCCGTCGACAAGCTTCCCGGGAGCTGGAATTCTTGAAGACG 
A AAGGGCCTCGTGAT ACTCCTATTTTTAT AGGT 
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Table 8 



5 ' CTGAACCAATCTGTAGAAATTAATTGTACAAGACCCAAC 
GACTTGGTTAGACATCTTTAATTAACATGTTCTGGGTTG 

AACAATACAAGAAAAAGTATCCGTATCCAGAGAGGACCAGGGAGAGCATTTGTT 
TTGTTATGTTCTTTTTCATAGGCATAGGTCTCTCCTGGTCCCTCTCGTAAACAA 

ACAATAGGAAAAATAGGAAATATGAGACAAGCACATTGTAACATTAGTAGAGCA 
TGTTATCCTTTTTATCCTTTATACTCTGTTCGTGTAACATTGTAATCATCTCGT 

AAATGGAATAACACTTTAAAACAGATAGATAGCAAATTAAGAGAACAATTTGGA 
TTTACCTTATTGTGAAATTTTGTCTATCTATCGTTTAATTCTCTTGTTAAACCT 

AATAATAAAACAATAATCTTTAAGCAGTCCTCAGGAGGGGACCCAGAAATTGTA 
TTATTATTTTGTTATTAGAAATTCGTCAGGAGTCCTCCCCTGGGTCTTTAACAT 

ACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTGTAATTCAACACAACTG- 
TGCGTGTCAAAATTAACACCTCCCCTTAAAAAGATGACATTAAGTTGTGTTGAC 

TTTAATAGTACTTGGTTTAATAGTACTTGGAGTACTAAAGGGTCAAATAACACT 
AAATTATCATGAACCAAATTATCATGAACCTCATGATTTCCCAGTTTATTGTGA 

GAAGGAAGTGACACAATCACCCTCCCATGCAGAATAAAACAAATTATAAA CAT G 
CTTCCTTCACTGTGTTAGTGGGAGGGTACGTCTTATTTTGTTTAATATTTGTAC 

TGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATCAGTGGACAAATTAGA 
ACCGTCCTTCATCCTTTTCGTTACATACGGGGAGGGTAGTCACCTGTTTAATCT 

TGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAGCA.-.C 
ACAAGTAGTTTATAATGTCCCGACGATAATTGTTCTCTACCACCATTATCC7TG 

AATGAGTCCGA 3 ' 

TTACTCAGGCTCTAG 
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GFGRAFY TTGQVIGRI RQAQC 
GFGRAFY TTKIJIIGTI RQAHC 
GFGRAFY TTGQVIGRI RQAQC 
GFGRAFY TMGQIIGDI RKAKC 
GFGRAFY ATGDIAGDL RQAKC 
GFGRAFY T GKIIGNI RQAHC 
GFGRAFY TTGEIIGDI RQAHC 
GFGRAFY TTGEIIGDI RQAKC 
GFGRAFY TTGEIIGn RQAHC 
GFGRAFY TTGEIIGDI RQAKC 
GFGRAFY TTGEI IGNI KQkliC 
GFGRAFY TTGEIIGDI RQAYC 
GFGRAFY TTGEIIGDI RQAHC 
GFGRAFY ATGAIIGDI RQAHC 
GFGRAFY ATGDIIGDI RQAKC 
GFGRAFY ATGDIIGDI RQAHC 
GFGRAFY ATGDIIGDI RQAHC 
GFGRATY ATGDIIGDI RQAHC 
GFGRAFY ATGDIIGDI KQAHC 
GPGRAFY TGEIKGHI RQAHC 
GFGRAFY"/ GEIRGKI KQAHC 
GPGRAFS TTRGI vGDI RQAYC 
GFGRAFH TTKRITGDl-J HQ ARC 
GPGRAFH TTKRITGD1-5 RQAKC 
GFGRAFH TTKRITGDH RGARC 
GPGRAFH TTKRITGDM RQARC 
GFGRAFH TTKRITGDH RQARC 
GPGRAFH TTKRITGDM RQARC 
GFGRAFY ATG GMGDI RQAHC 
GFGRAFY ATGGIEGDI RKARC 
GFGRAIY ATGDIIGDI RQAHC 
GFGRAVY TTGRIVGDI RQAHC 
GF GRAVY TTGRIVGDI RQAHC 
GFGRAVY TTGRIIGDI RQAHC 
GFGRAVY TTGRIIGDI RQAHC 
GFGRAVY TRTRIIGDI RQAHC 
GFGRAVY TARRIIGDI RQAHC 
GFGRAFY TTGAIKGKI RQAHC 
GPGRAFY TTGEIIGDI RQAHC 
GPGRAFY ATGDIIGNI RQAHC 
GFGRAFY' ATGEIIGDI RQAHC 
GFGRAFY ATRQIVGDI RQAHC 
GFGRAFY TTGQI IGDI RQAHC 
GPGRAFY ARGEI IGDI RRAXX 
GPGRAFY ATRRI IGDI RQAHC 
GFGRAFY ARQQIIGDI RQAHC 
GFGRAFY ARQQVIGDI RQAHC 
GFGRAFH TTGRIIGDI RKAHC 
GFGRAFH TTGRIIGDI RKAHC 
GPGRAFF ATGDIIGDI RQAKC 
GFGRAFR YRE IIGII RQAHC 
GFGRAFV TTKQIIGDI RQAYC 
GFGRAVY TTEKIIGDI RRAKC 
GFGRAVY TTEKIIGDI RRAHC 
GFGRAVY TAQRI IGDI RQAHC 
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GAGRAIY 
GAGRAIY 
GQGRAIH 
GQGRAIH 
G 3 GRAVY 
GSGRKVY 
G3GRKVY 
GLGQSliY 
GLGQALY 
GLGQALY 



T7GQIIGNI 
TTGEIIGDI 
ATGDIICDI 
TTKNIIGDI 
AKGGITG 
ATGEIIGEI 
7TGDIIGDI 
ATGDIIGDI 
TAFSIIADI 
TIGK IGKK 
TTGIVIGDI 
ATGQIIGDI 
ATGQIIGDI 
TTGEIIGDI 
TTGQVIGRI 
ATKKIIENI 
rQTGDVIGDI 
ATARI! GDI 
ATERIIGDI 
ATARIIGDI 
TTDRIIGDI 
TTDRIXGDI 
TRDRIMGDI 
TRHKIIGDI 
TRQKIIGDI 
TTR3RS II 
TTR IKGDI 
TTRGRTKII 



RGAHC 
RQAHC 
RQAYC 
RKAHC 
QAHC 
RQAHC 
RQAHC 
RQAHC 
3QAHC 
RQAHC 
RRAYC 
RKAHC 
RKAHC 
RKAYC 
RQAQC 
KKAHY 
RQAHC 
RQAHC 
RQAHC 
RQAHC 
RQAHC 
RQAHC 
RQAHC 
RQAHC 
RQAHC 
GQAHC 
RQAYC 
GQAHC 
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Table 12 

COMMON SEQUENCE PATTERNS 

SEQUENCE APPROXIMATE OCCURENCE (%\ 

I a I G P G R 60 

IalGPGR A 50 

IalGPGRAF 32 

IGPGRA 52 

GPGRAF 40 

IGPGRAF 35 

IGPGRAaY 31 
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Table 13 



IIIB 

MetThrArglleGlnArgGlyProGlyArgAIaPheVal GlyGlyGIyGly - 

RF 

SerlleThrArgGlyProGlyArgVallleTyr GlyGlyGIyGly - 

MN 

ArglleHisIleGlyProGlyArgAIaPheTyr GlyGlyGIyGly - 

SerlleHisIIeGlyProGlyArgAlaPheTyr GlyGlyGIyGly - 
WMJ1 

HisIleHisIIeGIyProGlyArgAIaPheTyr GlyGlyGIyGly - 



LeuSerlleCys 
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Table 13A 



E^oliBG 

MetLeuArgProValGluThrProThrArgGluIleLysLysLeuAspGlyLcuTrp - 



AlaPheSerLeuAspATgGluArgValValArgTyrHisArgTrpIleArgGlnAlaSer - 



IIIB 

MetThrArglleGlnArgGIyProGlyArgAIaPheVal GlyGlyGlyGly - 

RF 

SerlleThrArgGlyProGIyArgVallleTyr GlyGlyGlyGly - 

MN 

ArglleHisIleGIyProGlyArgAlaPheTyr GlyGlyGlyGly - 

SC 

SerlleHisIleGlyProGlyArgAlaPheTyr GlyGlyGlyGly - 

WMJ1 

HisIleHisIleGlyProGlyArgAlaPheTyr GlyGlyGlyGly - 



LcuSerlleCys 
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Claims 

1 1. A compound having the capability of eliciting, and/or binding with, neutralizing antibodies where 

2 said capability results from an amino acid sequence which: 

3 (a) is the principal neutralizing domain of an HTV variant; 

4 (b) is a portion of the principal neutralizing domain of an HTV variant; or 

5 (c) is equivalent either to a principal neutralizing domain or a portion thereof. 

1 2, A compound, other than naturally occurring HTV envelope protein, said compound having the 

2 capability of eliciting, and/or binding with, neutralizing antibodies, said compound comprising the 

3 principal neutralizing domain, or a segment thereof; of an HTV variant. 

1 3. The compound, according to claim 2, wherein said compound is modified by addition of one or 

2 more of the following moieties: cysteine, a protein or other moiety capable of enhancing 

3 immunogenic! ty, a peptide from an HTV principal neutralizing domain, a peptide capable of stimulating 

4 T-cells, or a general immune stimulant 

1 4. A compound having the capability of eliciting, and/or binding with, neutralizing antibodies, said 

2 compound consisting essentially of the amino adds between cysteine residues occurring at, or around, 

3 positions 296 and 331 of the envelope protein of an HIV variant 

1 5. A compound having the capability of eliciting, and/or binding with, HIV neutralizing antibodies, 

2 said compound having the formula 

3 axOzGyb 

4 wherein x is 0 to 13 amino acids in length; 

5 y is 0 to 17 amino acids in length; and 

6 z is P, A, S, Q, or L; and 

7 either n or b, but not both, may be omitted; cither a or b individually may comprise any one of the 

8 following: cysteine, a protein or other moiety capable of enhancing immunogenic) ty, a peptide from 

9 an HIV principal neutralizing domain, a peptide capable of stimulating T-cells, or a general immune 
10 stimulant 

1 6. The compound, according to claim 5, wherein said compound is circularized. 

1 7. The compound, according to claim 5, wherein said compound comprises epitopes from the 

2 principal neutralizing domain of more than one HIV variant 
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1 


8. A polypeptide selected from the group consisting of 


2 


(1) 


HIV 10 Kd fusion protein denoted Sub 1; 


3 


(2) 


HIV protein portion of Sub 1; 


4 


(3) 


HTV 18 Kd fusion protein denoted Sub 2; 


5 


(4) 


HIV protein portion of Sub 2; 


6 


(5) 


HTV 27 Kd fusion protein denoted PBIrp; 


7 


(6) 


HTV protein portion of PBIrp; 


8 


(7) 


HIV 28 Kd fusion protein denoted PBl^; 


9 


(8) 


HIV protein portion of PBImn; 


10 


(9) 


HIV 26 Kd fusion protein denoted PB1 SC ; 


11 


(10) 


HIV protein portion of PB1 SC ; 


12 


(") 


HIV 26 Kd fusion protein denoted PBIwmj^ 


13 


(12) 


HIV protein portion of PBlwxoi 


14 


(13) 


peptide m B (BH10)-PND; 


15 


(14) 


peptide RF-PND; 


16 


(15) 


peptide MN-PND; 


17 


(16) 


peptide SC-PND; 


18 


(17) 


peptide WMJ-2-PND; 


19 


(18) 


peptide LAV-MAL-PND; 


20 


(19) 


peptide SF-2-PND; 


21 


(20) 


peptide NY5-PND; 


22 


(21) 


peptide Z3-PND; 


23 


(22) 


peptide WMJ1-PND; 


24 


(23) 


peptide WMJ3-PND; 


25 


(24) 


peptide Z6-PND; 


26 


(25) 


peptide LAVELI-PND; 


27 


(26) 


peptide CDC451-PND; 


28 


(27) 


peptide CDC42-PND; 


29 


(28) 


peptide BAL-PND; 


30 


(29) 


peptide HTV-2-PND; 


31 


(30) 


peptide 135; 


32 


PD 


peptide 136; 


33 


(32) 


peptide 139; 


34 


(33) 


peptide 141; 


35 


(34) 


peptide 142; 


36 


(35) 


peptide 143; 


37 


(36) 


peptide 131; 
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38 


(37) 


peptide 132; 


39 


(38) 


peptide 134; 


40 


(39) 


peptide 339, 


41 


(40) 


RP342 (WMJ2); 


42 


(41) 


RP343 (SQ; 


43 


(42) 


RP60 (m B ); 


44 


(43) 


RP335 (m B ); 


45 


(44) 


RP337 (IH B ); 


46 


(45) 


RP77(m B ); 


47 


(46) 


RP83 (WMJ1); 


43 


(47) 


RP79 (m B ); 


49 


(48) 


RP57; 


50 


(49) 


RP55; 


51 


(50) 


RP75; 


52 


(51) 


RP56; 


53 


(52) 


RP59; 


54 


(53) 


RP73 (m Bl RF); 


55 


(54) 


RP74 (TH n ,RF,MN3Q; 


56 


(55) 


RP80 OllnJU 7 ); 


57 


(56) 


RP81 (IH^RF.WMJl^fN); 


58 


(57) 


RP82 fWMJl,MN); 


59 


(58) 


RP137 OIIaJlF); 


60 


(59) 


RP140 nn n ,RF): 


61 


(60) 


peptide 64 (HTV-IIIB/HIV-RP/HIV-MN/HrV-SC); 


62 


(61) 


peptide 338 (HIV-IIIb/HIV-RF); 


63 


(62) 


peptide 138; 


64 


(63) 


RP342; 


65 


(64) 


RP96; 


66 


(65) 


RP97; 


67 


(66) 


RP98; 


68 


(67) 


RP99; 


69 


(68) 


RP100; 


70 


(69) 


RP102; 


71 


(70) 


RP88; 


72 


(71) 


RP91; 


73 


(72) 


RP104; 


74 


(73) 


RP106; 
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75 


(74) 


RP108; 


76 


(75) 


RP70; 


77 


(76) 


RP84; 


78 


(77) 


RP144; 


79 


(78) 


RP145; 


80 


(79) 


RP146; 


81 


(80) 


RP147; 


82 


(81) 


RP150; 


83 


(82) 


RP151; 


84 


(83) 


RP63; 


85 


(84) 


RP41; 


86 


(85) 


RP61; 


87 


(86) 


RP75; 


88 


(87) 


RP111; 


89 


(88) 


RP113; 


90 


(89) 


RP114; 


91 


(90) 


RP116; 


92 


(91) 


RP120; 


93 


(92) 


RP121c; 


94 


(93) 


RP122c; 


95 


(94) 


RF123c; 


96 


and chemical modi 



1 

2 
3 
4 



9. A DNA sequence which codes for a polypeptide, other than the naturally occurring HIV 
envelope protein, said polypeptide having the capability of eliciting, and/or binding with, neutralizing 
antibodies, said polypeptide comprising the principal neutralizing domain, or a segment thereof, of an 
HIV variant 



1 
2 
3 
4 
5 
6 
7 
8 
9 



10. The DNA sequence, according to claim 9, wherein said polypeptide has the formula 

a x O i G y b 
wherein x is 0 to 13 amino acids in length; 

y is 0 to 17 amino acids in length; and 
* is P, A, S, Q, or L; and 
either . or b, but not both, may be omitted; either a or b individually may comprise any one of the 
following: cysteine, a protein or other moiety capable of enhancing immunogenic^, a peptide from 
an HIV principal neutralizing domain, a peptide capable of stimulating T-cellt, or a general immune 
stimulant 
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1 11. A composition capable of eliciting and/or binding with neutralizing antibodies to a broad range 

2 of HIV variants, said composition comprising one or more compounds having the following formula: 

3 axGzGyb 

4 wherein x is 0 to 13 amino acids in length; 

5 y is 0 to 17 amino acids in length; and 

6 z is P, A, S, Q, or L; and 

7 either a or b, but not both, may be omitted; either a or b individually may comprise any one of the 

8 following: cysteine, a protein or other moiety capable of enhancing immunogenicity, a peptide from 

9 an HIV principal neutralizing domain, a peptide capable of stimulating T-cells, or a general immune 
10 stimulant 

1 12. A composition, according to claim 11, wherein x is selected from: 

2 x(0) s x is not present 

3 x(l) = Xl 

4 x(2) s x 2 xj 

5 x(3) b x 3 x 2 X! 

6 x(4) 0x4X3X2 xj 

7 x(5) 0x5X4X3 x 2 X! 

8 x(6) 3^X5X4X3X2X1 

9 x(7) a x 7 x* x 5 X4 x 3 x 2 x t 

10 x(8) B Xg x 7 x* x 5 X4 x 3 x 2 x 2 

11 X(9) B X9 Xb X 7 Xfi X S X4 X 3 X 2 X : 

12 X(10) 9 X 10 X9 Xg X 7 X6 15 X4 x 3 x 2 x 2 

13 x(ll) m x n x 10 X9 Xb x 7 Xfi x s X4 x 3 x 2 x a 

14 x(12) m x 12 x n x 10 X9 xg x 7 X6 x 5 x, x 3 x 2 X! 

15 x(13) m x 13 x 12 x n X10 19 *fi ^7 X6 x 5 X4 X3 x 2 x x 

16 1 is P, L, A, S, or Q; and y is selected from: 

17 y(0) m y is not present 

18 y(l) e yi 

19 y(2) m y, y 2 

20 y(3) - yi y 2 y 3 

21 y(4) a yi y 2 y 3 y 4 

22 y(5) = yi y 2 y 3 y 4 y 5 

23 y(6) s yi y 2 y 3 y 4 y 5 y 6 

24 y(7) B yi y2 y 3 y 4 y f y 6 y ? 

25 y(8) K yi y 2 y3 y< y* y* y? ye 
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26 y( 9 ) ■ yi yi y* y4 ys y« y7 ys y9 

27 y(10) a yi y 2 y 3 y 4 y 5 y 6 y 7 y 8 y 9 y 10 

28 y(ll) a yi y 2 y 3 y 4 y 5 y 6 y 7 y 8 y 9 y 10 y n 

29 y(12) a yi y 2 y 3 y 4 y 5 y 6 y 7 y 8 y 9 y 10 y n y n 

30 y(13) a yi y 2 y 3 y 4 y 5 y fi y 7 y 8 y 9 y 10 y n y 12 y 13 

31 y(H) a yi y 2 y 3 y 4 y 5 y 6 y ? y g y 9 y 10 y n y 12 y 13 y u 

32 y(15) = y x y 2 y 3 y 4 y 5 y 6 y 7 y 8 y 9 y 10 y n y 12 y u y l4 y 15 

33 y(16) a yi y 2 y 3 y 4 y 5 y 6 y 7 y g y 9 y 10 y n y 12 y u y u y 15 y 16 

34 y(") a yi y 2 y 3 y 4 y 5 y 6 y, y 8 y 9 y 10 y n y 12 y 13 y u Yl3 y 16 y 17 

35 and wherein 

36 *i is I, R, M, IQR, V, L, K, F, S, G, Y, SRQ, or YQR; 

37 x 2 is H, R, Y, T, S, P, F, N, A, K, G, or V; 

38 x 3 is I, L, M, T, V, E, O, F, or Y; 

39 x 4 is R, S, G t H, A, K, or not present; 

40 x 5 is K, R, I, N, Q, A, IR, RQ, or not present; 

41 «6 is R, K, S, I, P, Q, E, G, or T, 

42 x 7 is T, K, V, I, A, R, P, or E; 

43 Xg is N, NV, Y f KI, I, T, DK, H, or K; 

44 X9 is N, S, K, E, Y t D, I, or Q; 

45 x 10 is N, Y, S, D ( G, or H; 

46 x„ is P; 

47 x 12 is R, I, or K; 

48 x 13 is T, I, M or A; 

49 yi is R, K» Q, G, S, or T; 

50 y 2 is A, V, N, R, K, T, S, F, P, or W; 

51 y 3 is F, I, V, L, W, Y, G, S, or T, 

52 y 4 is Y, V, H, L, F, S, I, T, M, R, VH, or FT; 

53 ys is T, A, V, Q, H, I, S, Y, or not present; 

54 y6 is T, R, I, Q, A, M, or not present; 

55 y? is G, E, K, R, T, D, Q, A, H, N, P, or not present; 

56 y 8 is R, Q, E, K, D, N, A, G, S, I, or not present; 

57 y9 is I, V, R, N, G, or not preseni; 

58 yio is I, T, V, K, M, R, L, S, E, Q, A, or not present; 

59 yn is G, R, E, K, H, or not present; 

60 y 12 is D, N, I, R, T, S, or not present; 

61 yi3 is I, M, ME, U or not present; 

62 yu is R, G, K, S, E, or not present; 
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63 yij is Q, K, or R; 

64 y 16 is A; and 

65 y„ is H, Y, R, or Q. 



1 13. The composition, according to claim 12, wherein 

2 x, is I; 

3 i is P; 

4 yi is R; and 

5 y 2 is A. 



1 14. The composition, according to claim 12, wherein 

2 Xl is I; 

3 x 3 is I; 

4 z is P; and 

5 yi is R. 

1 15. The composition, according to claim 12, wherein 

2 zisP; 

3 yjisR; 

4 y 2 is A; and 

5 y 3 is R 

1 16. A composition, according to claim 12, wherein 

2 xj is I; 

3 x 2 is H; 

4 x 3 is I; 

5 *4 is R; 

6 x 5 isK; 

7 XfiisR; 

8 x 7 is T; 

9 z is P; 

10 yi is R; 

11 y 2 is A; 

12 y 3 b F; 

13 y 4 is Y; 

14 y 5 is T; 

15 y 6 is T; and 
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16 y 7 is G. 

1 17. The composition, according to claim 11, wherein said compound is circularized. 

1 18. The composition, according to claim 11, wherein a and/or b comprise a peptide from an HIV 

2 principal neutralizing domain. 

1 19. The composition, according to claim 11, wherein said moiety capable of enhancing 

2 immunogenicity is a viral panicle, microorganism, or immunogenic portion thereof 

1 20. A prophylactic or therapeutic composition comprising immune globulin, monoclonal antibodies, 

2 and/or polyclonal antibodies generated by immunizing an appropriate animal such as a mouse, rat, 

3 horse, goat, human, or chimpanzee with a hybrid compound which comprises compounds where said 

4 polypeptides are not naturally occurring HIV envelope proteins, said polypeptides having the capability 

5 of eliciting, and/or binding with, neutralizing antibodies, said polypeptides comprising the principal 

6 neutralizing domain, or a segment thereof, of an HIV variant. 

1 21. A prophylactic or therapeutic composition comprising antibodies generated by immunizing an 

2 animal or human with at least one compound comprising a principal neutralizing domain of an HIV 

3 variant followed by immunization of said animal or human with 

4 (a) a mixture comprising compounds, where said compounds are not naturally occurring HIV 

5 envelope proteins, said compounds having the capability of eliciting, and/or binding with, 

6 neutralizing antibodies, said compounds comprising the principal neutralizing domain, or 

7 a segment thereof, of an HTV variant; or 

8 (b) a hybrid compound which comprises polypeptides, where said polypeptides are not naturally 

9 occurring HTV envelope proteins, said polypeptides having the capability of eliciting, and/or 

10 binding with, neutralizing antibodies, said polypeptides comprising the principal neutralizing 

11 domain, or a segment thereof, of an HTV variant 

1 22. A prophylactic or therapeutic composition, comprising an antibody raised against a mature 

2 comprising compounds, where said compounds are not naturally occurring HTV envelope proteins, but 

3 have the capability of eliciting, and/or binding with, neutralizing antibodies, each of said compounds 

4 comprising the principal neutralizing domain, or a segment thereof, of an HIV variant. 
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1 23. A prophylactic or therapeutic composition, comprising an antibody raised against at least one 

2 compound having the capability of eliciting, and/or binding with, HIV neutralizing antibodies, said 

3 compound having the formula 

4 ixOiOyb 

5 wherein x is 0 to 13 amino adds in length; 

6 y is 0 to 17 amino acids in length; and 

7 i is P, A, S, Q, or L; and 

8 either a or b, but not both, may be omitted; either a or b individually may comprise any one of the 

9 following: cysteine, a protein or other moiety capable of enhancing immunogenidty, a peptide from 

10 an HIV principal neutralizing domain, a peptide capable of stimulating T-cells, or a general immune 

11 stimulant 

1 24. The composition, according to claim 23, wherein a and/or b comprises a peptide from an HIV 

2 principal neutralizing domain. 

1 25. An antibody raised against a compound, said compound having the formula 

2 azGzGyb 

3 wherein x is 0 to 13 amino acids in length; 

4 y is 0 to 17 amino adds in length; and 

5 z is P, A, S, Q, or L; and 

6 either a or b, but not both, may be omitted; either a or b individually may comprise any one of the 

7 following: cysteine, a protein or other moiety capable of enhancing immunogenidty, a peptide from 

8 an HTV principal neutralizing domain, a peptide capable of stimulating T-cells, or a general immune 

9 stimulant. 

1 26. The antibody, according to claim 25, wherein x is selected from 

2 x(0) m x is not present 

3 x(l) E Xl 

4 x(2) - x 2 xj 

5 xC3) * x 3 x 2 Xi 

6 x(4) h X4 x 3 x 2 X! 

7 x(5) HX5X4X3 x 2 x, 

S x(6) a leg x 5 x< x 3 x 2 Xi 

9 x(7) bx 1 x 6 x 5 x a x 3 x 2 x : 

10 x(8) "6X8X7X6X5X4X3X2 Xj 

U X(9) B X9 Xg X 7 X« X 5 X4 x 3 X 2 X! 

12 x(10) m x 10 X9 xa x 7 x<i x 5 X4 x 3 x 2 Xj 
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y(0) 




y is not present 




y(i) 


■ 


yi 




y(2) 


B 


yiy2 




y(3) 


■ 


yi yz y3 




y(4) 


B 


yi y2 y3 y4 




y(5) 


B 


yi yi y3 y4 ys 




y(6) 




yi y2 y3 y4 ys ye 




y(7) 


B 


yi yi y3 y4 ys ye 


yi 


y(8) 


B 


yi yz Y3 ys ye 


y?ys 


y(9) 




yi Y2 Y3 ya ys y& 


y? y& yg 
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13 x(ll) m x u x 10 *> xe x 7 x* x 5 ^ x 3 x 2 x : 

14 x(12) e x 12 x n X 10 X9 Xg x 7 xe x 5 X4 x 3 x 2 x x 

15 x(13) ■ x u x 12 x n xjo 19X8X7X6X5X4X3X2 xj 

16 x is P, L, A, S, or Q; and y is selected from: 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 

27 y(10) b yj y 2 y 3 y 4 y 5 y 6 y 7 y g y g y lQ 

28 y(H) e y, y 2 y 3 y 4 y 5 y 6 y ? y B y 9 y, 0 y u 

29 y(12) ■ yj y 2 y 3 y 4 y 5 y 6 y 7 y 8 y 9 y 10 y n y 12 

30 y(i3) ■ yi y 2 y 3 y 4 y 5 y 6 y? ya ys yio yn yn yn 

31 y(14) e yi y 2 y 3 y 4 y 5 y 6 y? y g y ? y 1() y n y 12 yi3 y , 4 

32 y(15) S y, y 2 y 3 y 4 y 5 y 6 y ? y g y g y w y u y u y n y u y ^ 

33 y(16) B yj y 2 y 3 y 4 y £ y fi y ? y 8 y 9 y 1Q y u y^ y ^ , M yjfi 

34 y(") « yi y2 y 3 y< y 5 y« y? y 8 y 9 y« yn yn y u yu yu yn 

35 and wherein 

36 xj is 1, R, M, IQR, V, L, K, F, S, G, Y, SRG, or YQR; 

37 x 2 is H, R, Y, T, S, P, F, N, A, K, G, or V; 

38 x 3 is I, U M, T, V, E, G, F, or Y; 

39 X4 is R, S, G, H, A, K, or not present; 

40 X5 is K, R, I, N, Q, A, IR, RQ, or not present; 

41 X6 is R, K, S, I, P, Q, E, G, or T, 

42 x 7 is T, K, V, I, A, R, P, or E; 

43 xg is N, NV, Y, KI, I T, DK, H, or K; 

44 is N, S, K, E, Y, D, I, or Q; 

45 x ]0 is N, Y, S, D, G, or H; 

46 x n is P; 

47 x 12 is R, I, or K; 
x 13 is T, I, M or A; 



48 



49 yi is R, K, Q, G, S ? or T; 
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50 y 2 is A, V, N, R, K, T, S, F, P, or W; 

51 y 3 is F, I, V, L, W, Y, G, S, or T, 

52 y 4 is Y, V, H, U F. S, I, T, M ( R, VH, or FT, 

53 y 5 is T, A, V, Q, H, I, S, Y, or not present; 

54 y 6 is T, R, I, Q, A, M, or not present; 

55 y 7 is G, E, K, R, T, D, Q, A, H, N, P, or not present; 

56 y 8 is R, Q, E, K, D, N, A, G, S, I, or not present; 

57 y 9 is I, V, R, N, G, or not present; 

58 y 10 is I, T, V, K, M, R, L, S, E, Q, A, or not present; 

59 y n is G, R, E, K, H, or not present; 

60 y J2 is D, N, I, R, T, S, or not present; 

61 y 13 is I, M, ME, L, or not present; 

62 y 14 is R, G, K, S, E, or not present; 

63 y 15 is Q, K, or R; 

64 y 16 is A; and 

65 y 17 is H, Y, R, or Q. 

1 27. The antibody, according to claim 26, wherein said compound(s) are selected from the group 

2 consisting of 

3 (a) a-Y-S-N-V-R-N-R-I-H-I-G-P-G-R-A-F-H-T-T-K-R-I-T-b; 

4 (b) a-N-N-N-T-S-R-G-I-R-I-G-P-G-R-A-I-L-A-T-E-R-M-b; 

5 (c) a-N-N-N-T-R-K-G-I-F-I-G-P-G-R-N-I-Y-T-T-G-N-I-l-b; 

6 (d) a-N-T-R-K-S-I-R-l-Q-R-G-P-G-R-A-F-V-T-I-G-K-I-G-b; 

7 (c) a-N-N-N-T-R-K-R-(I or V)-T-M-G-P-G-R-V-(Y or W)-Y-(X or T>(A or T)-G-Q-M-b; 

8 (f) a -N-N-N-(I or 1>R-K-(R or S)-I-T-(R or K)-G-P-G-(R or K)-V-I-Y-A-T-G-Q-M-b; 

9 (£) a-N-N-N-T-R-K-G-I-Y-V-G-S-G-R-(A or K)-V-T-T-R-(D or H or Q)-K-I-(l or M)-b; and 
10 (h) a-T-R-Q-(R or S)-T-P-I-G-L-G-Q-(A or S)-L-Y-T-T-R-b. 

1 28. A vaccine composition for generating a broadly neutralizing immunological respon.se, said 

2 vaccine composition comprising a mixture comprising compounds, where said compounds are not 

3 naturally occurring HTV envelope proteins, but have the capability of eliciting, and/or binding with, 

4 neutralizing antibodies, said compounds comprising the principal neutralizing domain, or a segment 

5 thereof, of HTV variants. 

1 29. A vaccine composition, for generating a broadly neutralizing immunological response, said 

2 vaccine composition comprising a hybrid compound which comprises polypeptides, where said 

3 polypeptides are not naturally occurring HIV envelope proteins, but have the capability of eliciting, 
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4 and/or binding with, neutralizing antibodies, said polypeptides comprising the principal neutralizing 

5 domain, or a segment thereof, of as HIV variant 

1 30. A vaccine composition comprising at least one compound having the capability of eliciting, 

2 and/or binding with, HIV neutralizing antibodies, said compound having the formula 

3 axGiGyb 

4 wherein x is 0 to 13 amino adds in length; 

5 y is 0 to 17 amino adds in length; and 

6 z is P, A, S, Q, or L; and 

7 either a or b, but not both* may be omitted; either a or b individually may comprise any one of the 

8 following: cysteine, a protein or other moiety capable of enhancing immunogenidty, a peptide from 

9 an HIV principal neutralizing domain, a peptide capable of stimulating T-cells, or a general immune 
10 stimulant 

1 31. The vaccine composition, according to claim 30, wherein said composition is formulated with 

2 an immunological adjuvant 

1 32. The vaccine, according to claim 30, wherein x is selected from: 

2 x(0) ■ x is not present 

3 x(l) - x, 

4 x(2) m x 2 x l 

5 x(3) m x 3 x 2 

6 x(4) B X4 x 3 X 2 X! 

7 x(5) ■ x 5 Xt x 3 x 2 Xj 

8 x(6) ■ Xfi x 5 X4 x 3 x 2 Xj 

9 X(7) ■ X 7 X* X 5 X4 X 3 X 2 Xi 

10 x(8) ■ Xg X 7 X* X S X4 X 3 X 2 Xi 

11 x(9) 81X9X8X716X5X4X3 X 2 X! 

12 x(l0) a X 10 X, Xg X 7 X6 X 5 X4 X 3 X 2 Xj 

13 « *11 *10 X9 Xg X 7 X6 X 5 X4 X 3 X 2 X, 

14 x(12) m x 12 x n x 10 X, X 6 X 7 X* X 5 X4 x 3 X 2 X! 

15 x(13) m x n x 12 x n x 10 19X8X7X6X5X4X3X3 x l 

16 2 is P, L, A, S, or Q; and y is selected from: 

17 y(0) b y is not present 

18 y(l) B y, 

19 y(2) « y, y 2 

20 y(3) - Vi y 3 
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21 y(4) m yi y 2 y 3 y 4 

22 y(5) e yj y 2 y 3 y A y 5 

23 y(6) s yj y 2 y 3 y 4 y 5 y 6 

24 y(7) = y x y 2 y 3 y 4 y 5 y 6 y 7 

25 y(8) b yi y 2 y 3 y 4 y 3 y 6 y 7 y 8 

26 y(9) « yi y 2 y 3 y 4 y 5 y 6 y 7 y 8 y 9 

27 y(10) e yj y 2 y 3 y 4 y 5 y 6 y? ys V9 Yw 

28 y(ll) ■ y x y 2 y 3 y 4 y 5 y 6 fj y 8 y 9 y 10 yn 

29 y(12) b y a yj y 3 y 4 y 5 y 6 y 7 y s y 9 y 10 y n y 22 

30 y(13) e yi y 2 y 3 y 4 y 5 y 6 y 7 y 8 y 9 y 10 yn yn Yis 

31 y(14) ■ yi y 2 y 3 y 4 y 5 y 6 y 7 y 8 y 9 y w yn y u >i3 Yi4 

32 y(15) a yi y 2 y 3 y 4 y 5 y 6 y 7 y 8 y 9 y 10 y n y 12 y 13 y 14 y 13 

33 y(16) b yi y 2 y 3 y 4 y 5 y 6 y 7 y 8 y p y 10 y n y 12 y 13 y 14 y 15 y 16 

34 y(17) e yi y 2 y 3 y 4 y 5 y 6 y, y 8 y 9 y 10 y u y n y 13 y 14 yi5 y 16 y 17 

35 and wherein 

36 xj is I, R, M, IQR, V f L, K, F, S, G, Y, SRG, or YQR; 

37 x 2 is H, R, Y, T, S, P, F, N, A, K, G, or V; 

38 x 3 is I, L, M, T, V, E, G, F, or Y; 

39 X4 is R, S, G, H, A, K, or not present; 

40 x 5 is K, R, I, N, Q, A, IR, RQ, or not present; 

41 Xfi is R, K, S t I, P, Q, E, G, or T; 

42 x 7 is T, K, V, I, A, R, P, or E; 

43 x 8 is N, NV t Y, KI, I, T, DK, H, or K; 

44 xj, is N, S, K, E, Y, D, I, or Q; 

45 Xl0 is N, Y, S, D, G, or H; 

46 x u is P; 

47 x 12 is R, I, or K; 

48 x u is T, I, M or A; 

49 y, is R, K, Q, G, S, or T; 

50 y 2 is A, V, N, R, K, T, S, F, P, or W; 

51 y 3 is F, 1, V, U W, Y, G, S, or T; 

52 y 4 is Y, V, H, L, F, S, I, T, M, R, VH, or FT; 

53 ys is T, A, V, Q, H, I, S, Y, or not present; 

54 y 6 is T, R, I, Q, A, M, or not present; 

55 y 7 is G, E, K, R, T, D, Q, A, H, N, P, or not present; 

56 yg is R, Q, E, K, D, N, A, G, S, 1, or not present; 

57 y 9 is I, V, R, N, G, or not present; 
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58 yj 0 is I, T t V, K, M, R, U S, E, Q, A, or not present; 

59 y n is G, R, E, K, H, or not present; 

60 y n is D, N, I, R, T, S, or not present; 

61 y 13 is I, M, ME, L, or not present; 

62 y 14 is R, G, K, S, E, or not present; 

63 y 15 is Q, K, or R; 

64 y 16 is A; and 

65 y„ is H, Y. R, or Q. 

1 33. The vaccine, according to rUim 32, wherein 

2 xj is I; 

3 x 2 is H; 

4 X3 is I; 

5 X4*sR; 

6 x 5 isK; 

7 ^isR; 

8 x 7 is Tt 

9 xisP; 

10 y, is R; 

11 y 2 is A; 

12 y3 is F; 

13 y 4 is Y; 

14 y 5 is T; 

15 y 6 is T; and 

16 y 7 is G. 

1 34. The vaccine, according to claim 32, wherein said compound (s) are selected from the group 

2 consisting of 

3 (a) a-Y-S-N-V-R-N-R-I-H-I-G-P-G-R-A-F-H-T-T-K-R-l-T-h; 

4 (b) a-N-N-N-T-S-R-G-I-R-I-G-P-G-R-A-I-UA-T-E-R-I-I-b; 

5 (c) a-N-N-N-T-R-K-G-I-F-I-G-P43-R-N-I.Y-T-T-0-N-I.l-b; 

6 (d) a-N-T-R-K-S-I-R-I-Q-R-G-P-G-R-A-F-V-T-I-G-K-I-G-b; 

7 (e) a-N-N-N-T-R-K-R-(I or V>T-M-G-P-G-R-V-(Y or W)-Y-(X or 1>(A or T>-G-Q-M-b; 

8 (f) a-N-N-N-(I or T>R-K-(R or S)-I-T-(R or K)-G-P-G-(R or K>-V-i-Y-A-T-G-0-M-b; 

9 (£) «-N-N-N-T-R-K-G-I-Y.V-G-S-G-R-(A or K)-V-T-T-R-(D or H or Q)-K-!-(l or M>-b : and 
10 0») a-T-R-Q-(R or S>-T-P-I-G-L-0-Q-(A or S>-L.Y-T-T-R-b. 
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1 35. The vaccine composition of claim 32 wherein said compound(s) are capable of eliciting 

2 antibodies that bind to the sequence G-P-G-R-A-F. 

1 36. The vaccine composition of claim 32 wherein said compoundfs) are capable of eliciting 

2 antibodies that bind to the sequence I-G-P-G-R-A-F. 

1 37. The vaccine composition of claim 32 wherein said com pound (s) are capable of eliciting 

2 antibodies that bind to the sequence I-G-P-G-R-A. 

1 38. The vaccine composition of claim 32 wherein said compound(s) are capable of eliciting 

2 antibodies that bind to the sequence 1-a-I-G-P-G-R, wherein a is any of the 20 amino acids. 

1 39. The vaccine, according to claim 38, wherein a is H. 

1 40. The vaccine composition of claim 32 wherein said com pound (s) are capable of eliciting 

2 antibodies that bind to the sequence I-a-I-G-P-G-R- A, wherein a is any of the 20 amino acids. 

1 41. The vaccine, according to claim 40, wherein a is R 

1 42. The vaccine composition of claim 32 wherein said compound(s) are capable of eliciting 

2 antibodies that bind to the sequence I-a-I-G-P-G-R- A-F, wherein a is any of the 20 amino acids. 

1 43. The vaccine, according to claim 42, wherein a is PL 

1 44. The vaccine composition, according to claim 30, wherein said compound(s) are circularized. 

1 45. The vaccine composition, according to claim 30, wherein said compound (s) comprise epitopes 

2 from more than one HTV variant 

1 46. The vaccine, according to claim 32, wherein 

2 xj is I; 

3 zisP; 

4 y, is R; and 

5 y 2 is A. 

1 47. The composition, according to claim 32, wherein 

2 Xl is I; 
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3 x 3 is I; 

4 i is P; and 

5 y 1 is R. 

1 48. The composition, according to claim 32, wherein 

2 z is P; 

3 yi isR; 

4 y 2 is A; and 

5 y 3 is R 

1 49. The vaccine, according to claim 32, wherein x is selected from the group consisting of x(5), x(6), 

2 x(7), x(8), x(9), x(10), and x(ll); and y is selected from y(5), y(6), y(7), y(8), y(9), y(10), and y(ll). 

1 50. The composition, according to claim 32, wherein said compound has the following amino acid 

2 sequence: 

1 51. A kit for use in detecting antibody against HIV in a biological fluid, said kit comprising: 

2 (a) a compound, where said compound is not a naturally occurring HIV envelope protein, said 

3 compound having the capability of eliciting, and/or binding with, neutralizing antibodies, said 

4 compound comprising the principal neutralizing domain, or a segment thereof, of an HIV 

5 variant; and 

6 (b) a means for detecting complexes formed between said antibody and said compound. 

1 52. The Id t, according to claim 51, wherein said com poo nd has the formula 

2 xGxOy 

3 wherein x is 0 to 13 amino acids in length; 

4 y is 0 to 17 amino adds in length; and 

5 zisP, A,S,Q, orL 

1 53. The kit, according to clafm 52, wherein x is selected from 

2 x(0) m x is not present 

3 x(l) = Xl 

4 *(2) e, x 2 Xl 

5 x(3) * x 3 x 2 x, 

6 x(4) e^i 3 x 2 x l 

7 *( 5 ) ■ X 5 X,j X 3 X 2 Xj 
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8 x(6) 8X6X5X4X3X2 xj 

9 x(7) - x 7 X6 x s X4 x 3 x 2 x x 

10 x(8) = Xg X 7 X6 X5 X4 x 3 x 2 Xi 

11 X(9) H Xy X 8 X 7 Xfi X 5 X4 X 3 X 2 Xj 

12 X(10) E X 10 Xg X 7 X6 X5 X4 X3 X 2 Xj 

13 x(ll) E X U X 10 319X8X7X6X5X4 x 3 x 2 Xj 

14 X(12) S X 12 X„ X 10 X9 X 8 X 7 Xfi Xj X4 x 3 x 2 X! 

15 X(13) E X 13 X 12 X n X 10 Xs Xe X 7 Xfi x 5 X4 X 3 X 2 Xj 

16 z is P, L, A, S, or Q; and y is selected from: 

17 y(0) s y is not present 

18 y(1) m Yl 

19 y(2) a y x y 2 

20 y(3) s y t y 2 y 3 

21 y(4) e yj y 2 y 3 y 4 

22 y(5) = yi y 2 y3 y4 ys 

23 y(6) e yi y 2 y 3 y 4 y 5 y 6 

24 y(7) s y t y 2 y 3 y 4 y 5 y 6 y 7 

25 y(») B yi yi ys y4 ys y* y? ys 

26 y(9) b yi y 2 y 3 y 4 y 5 y 6 y 7 y 8 y 9 

27 y(10) b yi y 2 y 3 y 4 y 5 y 6 y 7 y 8 y 9 y 10 

28 y(H) K yi y2 y3 y< ys y& y? ye y» yio yn 

29 y(12) ■ yi y 2 y 3 y 4 y s y 6 y 7 y 8 y 9 y 10 y n y 12 

30 y(13) E yi y 2 y 3 y 4 y 5 y 6 y 7 y B y 9 y 10 y n y J2 y B 

31 y(14) m yi y 2 y 3 y 4 y 5 y 6 y 7 y g y 9 y 1Q y u y n y a y u 

32 y(15) e yj y 2 y 3 y 4 y 5 y 6 y 7 y 8 y 9 y 10 y n y n yi3 y u yu 

33 y(16) e yi y 2 y 3 y 4 y 5 y 6 y 7 y 8 y 9 y 10 y n y n y n y 14 yi5 y lfi 

34 y(17) e yi y2 y 3 y 4 y 5 y 6 y 7 y g y 9 y 10 y n y 12 y 13 y „ yiJ y lfi y J7 

35 and wherein 

36 x 3 is I, R, M, IQR, V, L, K, F, S, G, Y, SRG, or YQR; 

37 x 2 is H, R, Y, T, S, P, F ( N, A, K, G, or V; 

38 x 3 is I, L, M, T, V, E, G, F, or Y; 

39 X4 is R, S, G, H, A, K, or not present; 

40 x 5 is K, R, I, N, Q, A, IR, RQ, or not present; 

41 X6 is R, K, S, I, P, Q, E, G, or T; 

42 x 7 is T, K, V, I, a. R, P, or E; 

43 x 8 is N, NV, Y, KI, I, T, DK, H, or K; 

44 x 9 is N, S, K, E, Y, D, I, or Q; 
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45 x 10 is N, Y, S, D, G, or H; 

46 x u is P; 

47 x 12 is R, I, or K; 

48 x 13 is T, I, M or A; 

49 yj is R, K, Q, G f S, or T; 

50 y 2 is A, V, N, R, K, T, S, F, P, or W; 

51 y 3 is F, I, V, U W, Y, G, S, or T, 

52 y 4 is Y, V, H. U F, S, I T, M, R, VH, or FT. 

53 y 5 is T, A, V, Q, H, I, S, Y, or not present; 

54 y 6 is T, R, I, Q, A, M, or not present; 

55 y 7 is G, E, K, R, T, D, Q, A, H, N, P, or not present; 

56 y 8 is R, Q, E, K, D, N, A, G, S, I, or not present; 

57 y 9 is I, V, R, N ( G, or not present; 

58 yio is I, T, V, K, M, R, L, S, E, Q, A, or not present; 

59 y n is G, R, E, K, H, or not present; 

60 y 12 is D t N, I, R, T, S, or not present; 

61 yi3 is I, M, ME, U or not present; 

62 yi4 is R, G, K, S, E, or not present; 

63 yi5 is Q, K, or R; 

64 y l6 is A; and 

65 y 17 is H, Y, R, or Q, 

1 54. The tit, according to claim 51, wherein said compound(s) are selected from the group consisting 

2 of 

3 (a) a-Y-S-N-V-R-N-R-I-H-I-G-P-G-R-A-F-H-T-T-K-R-I-T-b; 

4 (b) a-N-N-N-T-S-R-G-I-R-I-G-P-G-R-A-I-l^A-T-E-R-M-b; 

5 (c) a-N-N-N-T-R-K-CI-F-I-G-P^J-R-N-I-Y-T-T-G-N-r-I-b; 

6 (d) a-N-TR-K-S-I-R-I-Q-R-G-P^.R-A-F-V-T-I-G-K-I-G-b; 

7 (e) a-N-N-N-T-R-K-R-(I or V)-T-M-G.P-G-R-V-(Y or W)-Y-(X or T)-(A or T>G-Q-M-b; 

8 (0 a-N-N-N-(I or T)-R-K-(R or S>I-T-(R or K)-G-P-G-{R or K)-V-I Y-A-T-G-Q-l-Lb; 

9 (g) a-N-N-N-T-R-K-G-I-Y-V-G^G^A or K)-V-T-T.R-(D or H or Q)-K-1-(I or M>b; and 
10 0») o-T-R-Q-(R or S)-T-P-I-G-L-G-Q-(A or S)-L-Y-T-T-R-b. 

1 55. An immunological assay for detecting and/or quantifying antibody against HIV in a fluid, said 

2 assay utilizing at least one compound, where said compound is not a naturally occurring HIV envelope 

3 protein, said compound having the capability of eliciting, and/or binding with, neutralizing antibodies, 

4 said compound comprising the principal neutralizing domain, or a segment thereof, of an HIV variant 



WO 90/03984 



PCT/US89/04302 



92 



1 56. The immunological assay, according to claim 55, wherein said compound has the formula: 

2 axGzGyb 

3 wherein x is 0 to 13 amino acids in length; 

4 y is 0 to 17 amino acids in length; and 

5 z is P, A, S, Q, or L; and 

6 either a or b, but not both, may be omitted; either a or b individually may comprise any one of the 

7 following: cysteine, a protein or other moiety capable of enhancing immunogeniciry, a peptide from 

8 an HIV principal neutralizing domain, a peptide capable of stimulating T-cells, or a general immune 

9 stimulant. 

1 57. A method for generating broad neutralizing polyclonal or monoclonal antibodies, said method 

2 comprising immunization with a mixture comprising compounds, where said compounds are not 

3 naturally occurring HTV envelope proteins, said compounds having the capability of eliciting, and/or 

4 binding with, neutralizing antibodies, where each of said compounds comprises the principal 

5 neutralizing domain, or a segment thereof, of an HTV variant. 

1 58. A method for generating broad neutralizing polyclonal or monoclonal antibodies, said method 

2 comprising immunization with a hybrid compound which comprises polypeptides having the capability 

3 of eliciting, and/or binding with, neutralizing antibodies, where each of said polypeptides comprises 

4 the principal neutralizing domain, or a segment thereof, of an HTV variant 

1 59. A method for generating broad neutralizing antibodies, said method comprising immunization 

2 or an animal or human with at least one compound comprising a principal neutralizing domain, or 

3 segment thereof, of an HTV variant followed by immunization of said animal or human with 

4 (a) a mixture comprising compounds, where said compounds are not naturally occurring HTV 

5 envelope proteins, said compounds having the capability of eliciting, and/or binding with, 

6 neutralizing antibodies, said compounds comprising the principal neutralizing domain, or 

7 a segment thereof, of an HTV variant; or 

8 (b) a hybrid compound which comprises polypeptides, where said polypeptides are not naturally 

9 occurring HTV envelope proteins, said polypeptides having the capability of eliciting, and/or 

10 binding with, neutralizing antibodies, said polypeptides comprising the principal neutralizing 

11 domain, or a segment thereof, of an HTV variant. 
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1 60. A method, according to claim 59, said method comprising immunization with at least one 

2 compound having the capability of eliciting, and/or binding with, neutralizing antibodies, said 

3 compound having the formula 

4 axGiGyb 

5 wherein x is 0 to 13 amino acids in length; 

6 y is 0 to 17 amino adds in length; and 

7 z is P, A, S, Q, or L; and 

8 either a or b, but not both, may be omitted; either a or b individually may comprise any one of the 

9 following: cysteine, a protein or other moiety capable of enhancing immunogenic! ty, a peptide from 

10 an HTV principal neutralizing domain, a peptide capable of stimulating T-cells, or a general immune 

11 stimulant. 

1 61. A method of detecting antibody against HTV in a biological fluid, comprising the steps of: 

2 (a) incubating a compound, other than a naturally occurring HTV envelope protein, said 

3 compound having the capability of eliciting, and/or binding with, neutralizing antibodies, said 

4 compound comprising the principal neutralizing domain, or a segment thereof, of an HIV 

5 variant; 

6 (b) detecting complexes formed between said antibody and said compound. 

1 62. The method, according to claim 61, wherein said compound has the following formula: 

2 lOiOy 

3 wherein x is 0 to 13 amino adds in length; 

4 y is 0 to 17 amino adds in length; and 

5 2 is P, A, S, Q, or L. 

1 63. A method for prophylaxis or treatment of HIV infection, said method comprising administering 

2 to an animal or human in need of such prophylaxis or treatment a composition comprising an antibody 

3 raised against a mixture comprising compounds, where said compounds are not naturally occurring 

4 HTV envelope proteins, but have the capability of ehdting, and/or binding with, neutralizing antibodies, 

5 each of said compounds comprising the prindpal neutralizing domain, or a segment thereof, of an HTV 

6 variant. 

1 64. The method, according to daim 63, wherein said compounds have the following formula: 

2 aiOiGyb 

3 wherein x is 0 to 13 amino adds in length; 

4 y is 0 to 17 amino adds in length; and 

5 z is P, A, S, Q, or L; and 
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6 either a or b, but not both, may be omitted; either a or b individually may comprise any one of the 

7 following: cysteine, a protein or other moiety capable of enhancing immunogenicity, a peptide from 

8 an HIV principal neutralizing domain, a peptide capable of stimulating T-cells, or a general immune 

9 stimulant. 

1 65. A method for stimulating a lymphocyte proliferative response in humans which comprises 

2 treating humans in need of stimulation of a lymphocyte proliferative response with at least one 

3 compound, said compound is not a naturally occurring HTV envelope protein, said compound having 

4 the capability of eliciting, and/or binding with, neutralizing antibodies, said compound comprising the 

5 principal neutralizing domain, or a segment thereof, of an HTV variant 

1 66. The method, according to claim 65, wherein said compound has the formula 

2 axGiGyb 

3 wherein x is 0 to 13 amino acids in length; 

4 y is 0 to 17 amino acids in length; and 

5 z is P, A, S, Q, or L; and 

6 either a or b, but not both, may be omitted; either a or b individually may comprise any one of the 

7 following: cysteine, a protein or other moiety capable of enhancing immunogenicity, a peptide from 

8 an HTV principal neutralizing domain, a peptide capable of stimulating T-cells, or a general immune 

9 stimulant. 

1 67. A method for treatment of HTV infection, said method comprising administering to an animal 

2 or human in need of such treatment a composition comprising one or more compounds wherein said 

3 compounds are not naturally occurring HTV envelope proteins, but have the capability of eliciting, 

4 and/or binding with, neutralizing antibodies, each of said compounds comprising the principal 

5 neutralizing domain, or a segment thereof, of an HIV variant. 

1 68. The method, according to claim 67, wherein said compounds have the formula 

2 ixGiGjb 

3 wherein x is 0 to 13 amino acids in length; 

4 y is 0 to 17 amino acids in length; and 

5 z is P, A, S, Q, or L; and 

6 either a or b, but not both, may be omitted; either a or b individually may comprise any one of the 

7 following: cysteine, a protein or other moiety capable of enhancing immunogenicity, a peptide from 

8 an HIV principal neutralizing domain, a peptide capable of stimulating T-cells, or a general immune 

9 stimulant. 
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1 69. A method for assaying a biological fluid for the presence of an HIV variant-specific protein, 

2 said method comprising 

3 (a) contacting said Quid with an antibody specific for compounds wherein said compounds are 

4 not naturally occurring HTV envelope proteins, but have the capability of eliciting, and/or 

5 binding with, neutralizing antibodies, said compounds comprising the principal neutralizing 

6 domain, or a segment thereof, of an HTV variant; and 

7 (b) detecting immune complexes as a measure of said variant in said fluid. 

1 70. The method, according to claim 69, wherein said compounds of part (c) have the formula 

2 ixOiOjb 

3 wherein x is 0 to 13 amino adds in length; 

4 y is 0 to 17 amino acids in length; and 

5 z is P, A, S, Q, or L; and 

6 either a or b, but not both, may be omitted; either a or b individually may comprise any one of the 

7 following: cysteine, a protein or other moiety capable of enhancing immunogenic! ty, a peptide from 

8 an HTV principal neutralizing domain, a peptide capable of stimulating T-cells, or a general immune 

9 stimulant 

1 71. A method of in vitro lymphocyte stimulation comprising treating lymphoid cells from immune 

2 animals with at least one compound, other than naturally occurring HTV envelope proteins, said 

3 compound having the capability of eliciting, and/or binding with, neutralizing antibodies, said 

4 compound comprising the principal neutralizing domain, or a segment thereof, of an HIV variant. 

1 72. The method, according to claim 71, where said compound has the formula 

2 axGzGyb 

3 wherein x is 0 to 13 amino acids in length; 

4 y is 0 to 17 amino acids in length; and 

5 i is P, A, S, Q, or L; and 

6 either a or b, but not both, may be omitted; either a or b individually may comprise any one of the 

7 following: cysteine, a protein or other moiery capable of enhancing immunogenicity, a peptide from 

8 an HIV principal neutralizing domain, a peptide capable of stimulating T-cells, or a general immune 

9 stimulant. 

1 73. A method of identifying polyclonal or monoclonal antibodies which are useful in prophylaxis 

2 or therapy, said method comprising screening antibody-producing cells for ability to bind with at least 

3 one compound, other than naturally occurring HIV envelope proteins, said compound having the 
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4 capability of eliciting, andTor binding with, neutralizing antibodies, said compound comprising the 

5 principal neutralizing domain, or a segment thereof, of an HTV variant 

1 74. The method, according to claim 73, wherein said compound has the formula 

2 axGzGyb 

3 wherein x is 0 to 13 amino acids in length; 

4 y is 0 to 17 amino acids in length; and 

5 z is P, A, S, Q, or L; and 

6 either a or b, bnt not both, may be omitted; either a or b individually may comprise any one of the 

7 following: cysteine, a protein or other moiety capable of enhancing immunogenicity, a peptide from 

8 an HTV principal neutralizing domain, a peptide capable of stimulating T- eel Is, or a general immune 

9 stimulant 

1 75. A synthetic gene which encodes for a polypeptide wherein said polypeptide comprises more than 

2 one HTV neutralizing epitope. 

1 76. A synthetic gene which codes for a polypeptide wherein said polypeptide comprises neutralizing 

2 epitopes from more than one HIV isolate. 

1 77. The synthetic gene, according to claim 76, wherein said polypeptide comprises neutralizing 

2 epitopes from 2 to 20 HTV isolates. 

1 78. The synthetic gene, according to claim 76, wherein one of said neutralizing epitopes is from 

2 the HIV isolate designated HIV-MN. 

1 79. The synthetic gene, according to claim 76, wherein said polypeptide comprises neutralizing 

2 epitopes from the HIV isolates which have been designated HTV-MN, HIV-IUB, HTV-RF, HTV-SC, 

3 and HJV-WMJ1. 

1 80. The synthetic gene, according to claim 76, wherein the regions encoding neutralizing epitopes 

2 from different HTV isolates are separated by amino acid spacers. 

1 81. The synthetic gene, according to claim 80, wherein said amino add spacers are glycines. 

1 82. The synthetic gene, according to claim 76, wherein said gene codes for a polypeptide having 

2 the amino acid sequence shown in Table 13, or an equivalent amino acid sequence. 
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1 83. The synthetic gene, according to claim 76, wherein said gene codes for a fusion polypeptide. 

1 84. The synthetic gene, according to claim 83, wherein said gene codes for a polypeptide having 

2 the amino acid sequence shown in Table 13A, or an equivalent amino acid sequence. 

1 85. A compound comprising neutralizing epitopes from more than one HIV isolate. 

1 86. The compound, according to claim 85, wherein said compound comprises neutralizing epitopes 

2 from 2 to 20 HIV isolates, 

1 87. The compound, according to claim 85, wherein one of said neutralizing epitopes is from the 

2 HIV isolate designated HIV-MN. 

1 88. The compound, according to claim 85, wherein said compound comprises neutralizing epitopes 

2 from the HIV isolates which have been designated HTV-MN, HTV-IIIB, HTV-RF, HIV-SC, and HIV- 

3 WMJ1. 

1 89. The compound, according to claim 85, wherein said neutralizing epitopes are separated by 

2 amino acid spacers. 

1 90. The compound, according to claim 89, wherein said amino acid spacers are glycines. 

1 91. The compound, according to claim 85, wherein said compound has the amino acid sequence 

2 shown in Table 13, or an equivalent amino add sequence. 

1 92. The compound, according to claim 85, wherein said compound is a fusion polypeptide. 

1 93. The compound, according to claim 92, wherein said compound has the amino add sequence 

2 shown in Table 13A, or an equivalent amino add sequence, 

1 94. A prophylactic or therapeutic composition, comprising immune globulin, monoclonal antibodies, 

2 and/or polyclonal antibodies generated by immunizing an appropriate animal with a composition 

3 comprising a multi-epitope compound wherein said multi-epitope compound comprises neutralizing 

4 epitopes from 2 to 20 HIV isolates, wherein one of said isolates is HIV-MN. 
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1 95. The composition, according to claim 94, wherein said multi-epitope compound comprises 

2 neutralizing epitopes from the HIV isolates designated HTV-MN, HTV-IIIB, HIV-RF, HIV-SC, and 

3 HIV-WMJ1. 

1 96. A method for generating broad neutralizing polyclonal or monoclonal antibodies, said method 

2 comprising immunizing an appropriate animal with a composition comprising a multi-epitope 

3 compound wherein said multi-epitope compound comprises neutralizing epitopes from 2 to 20 HIV 

4 isolates, wherein one of said isolates is HTV-MN. 

1 97. The method, according to claim 96, wherein said multi-epitope compound comprises neutralizing 

2 epitopes from the HTV isolates which have been designated HTV-MN, HIV-IIIB, HIV-RF, HIV-SC, 

3 and HIV-WMJ1. 

1 98. A method for prophylaxis or therapy of HTV infection, said method comprising administering 

2 to an animal or human in need of such prophylaxis or therapy a pharmaceutical composition 

3 comprising immune globulin, monoclonal antibodies, and/or polyclonal antibodies generated by 

4 immunizing an appropriate animal with a composition comprising a multi-epitope compound wherein 

5 said compound comprises neutralizing epitopes from 2 to 20 HIV isolates, wherein one of said isolates 

6 is HTV-MN. 

1 99. The method, according to claim 98, wherein said multi-epitope compound comprises neutralizing 

2 domains from the HTV isolates which have been designated HTV-MN, HIV-IIIB, HIV-RF, HIV-SC, 

3 and HTV-WMJl. 

1 100. A process for stimulating a lymphocyte proliferative response in humans which comprises 

2 treating humans in need of stimulation of a lymphocyte proliferative response with a composition 

3 comprising a multi-epitope compound wherein said compound comprises a neutralizing domain from 

4 2 to 20 HTV isolates, wherein one of said isolates is HTV-MN. 

1 101. The process, according to claim 100, wherein said multi-epitope compound comprises 

2 neutralizing epitopes from the HTV isolates which have been designated HTV-MN, HIV-IIIB, HIV- 

3 RF, HIV-SC, and HTV-WMJ1. 



WO 90/03984 



1/4 



PCT/US89/04302 



FIGURE 1 

Common Sequence Patterns 



KRKRIHtQPQ R A F Y T T K 

OCCURRENCE 

6 V - N H 



T S R G — R I L A - £ 



T Q - P N I Q 



Q R V'- I Q 



T y T M G 



T — g»TK R K V I -A - G 



5 T — g -y v — s — — r 

4 T -Q g TP - - L - Q £ L * 



SUBSTITUTE SHEET 



WO 90/03984 



2/4 



PCT/US89/04302 




SUBSTITUTE SHEET 



WO 90/03984 



3/4 



PCT/US89/04302 



FIGURE 3 
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FIGURE 4 - OLIGONUCLEOTIDES FOR CONSTRUCTION 
OF A MULTT-EPITOPE GENE 



HEO-1 

40 

5' CAGTCAAGCT TCCATGACGC GTATCCAGCG TGGTCCGGGT 

73 

CGTGCnTTG TTGGTGGCGG AGGCTCCATC ACC 3* 



40 

5' TAGAAGGCTC TTCCAGGTCC GATGTGAA1T CGACCCCCTC 

78 

CTCCGTAGAT AACCCTTCCT GGTCCCTTGG TGATGGAG 3' 



HEO-3 



40 

5' CGGTGGTGGA GGATCCATAC ATATAGGACC TGGAAG AG CA 

70 

TTTTATGGTG GAGGTGGTCA CATTCACATC V 



HEO-4 



5' GACCGAGCTC AG CAAATCG A TAGGCCCCCT CCGCCGTAGA 

67 

AAGCACGACC CGGACCGATG TGAATGT 3' 
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